BEST AVAILABLE CO^Y 



GenCore vara ion 5.1.9 
Copyright (c) 1993 - 2006 Bioccel oration Ltd. 



OH protein - protein search, using aw model 



Title: 

Perfect ecore: 
Sequence ; 



June 27, 2006, 01:46:32 ; Search time 176 Seconds 
(without alignments) 
1446.696 Million cell updates/sec 

US-10-6B9-S76-1 
2941 

1 HARTTSQL YDAVF I QSSWL RKI QEWRQRYQSH PDAAVQ 556 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2097797 seqs, 4632148SB residues 

Total number of hits satisfying chosen parameters: 2097797 

Minimum DB seq length: 0 

Maximum DB esq length: 2000000000 

Poet -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applicetione_AA_Main : • 

1 : /EJ4C_Celerra_SIDS3/ptodata/2/pubpaa/US07_PUBCOMB.pep: • 

2: /EMC~Celerra~SIDS3/ptodata/2/pubpaa/US0B~PUBCCKB.pep:* 

3 : /EMC_Celerra~SIDS3/ptodata/2/pubpaa/US09_PUBCOMB.pep: • 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US10A_[>UBCOMB.pep: • 

5: /EMC_Colorra_SIDS3/ptodata/2/pubpaa/US10B_PUBCOMB.pep:« 

6 : /SMC~Celerra~SIDS3/ptodata/2/pubpaa/USll_PUBCOMB.pep: • 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Score 



Query 

Match Length DB 



2941 
2941 
2941 
2941 
2941 
2941 
2941 
2941 
2941 
2941 
2941 



100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
100.0 
100.0 



556 
556 
556 
556 
556 
556 
556 
556 
556 
SS6 
556 



09-771- 
i-10-190- 
-10-376 
-10-704- 
-10-473- 
-10-473- 
-10-473 
-10-473- 
-10-473- 
-10-473- 
-10-473- 



161A-245 

012-1 

554-6 

921-14 

127-1372 

127-1373 

127-1374 

127-1377 

127-1378 

127-1380 

127-1382 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



Sequence 
Sequence 



245, App 
1. Appli 
6, Appli 
14, Appl 

1372, Ap 

1373, Ap 

1374, Ap 

1377. Ap 

1378, Ap 
1380, Ap 
1382, Ap 



2941 


100.0 


S56 


5 


US -10 -473 -127- 1383 


Sequence 1383, Ap 


2941 


100.0 


556 


5 


US -10 -473 -127- 1384 


Sequence 1364, Ap 


2941 


100.0 


556 


5 


US-10-723-860-20S3 


Sequence 2053, Ap 


2941 


100.0 


556 


5 


US - 10 - 669 - 576 • 1 




2941 


100. 0 


556 


s 


US -10 -509- 599- 28 


Sequence 28, Appl 


2838 


96. 5 


535 


5 


US -10 -473 -127- 1379 


Sequence 1379, Ap 


2682 


91. 2 


506 


5 


US -10 -473 -127-1381 


Sequence 1381, Ap 


2202.5 


74. 9 


468 


5 


US-10-473 -127-1371 


Sequence 1371, Ap 


2202.5 


74. 9 


468 


5 


US -10 -473 -127-1385 


Sequence 1385, Ap 


1900 


64. 6 


361 


4 


US -10 -664 -421-106 


Sequence 106, App 


1900 


64 .6 


361 


5 


US-10-941-635-107 


Sequence 107, App 


1710 


58. 1 


319 


5 


US-10-473 -127-1375 


Sequence 1375, Ap 


1710 


58. 1 


319 


5 


US-10-473 -127-1386 


Sequence 1386, Ap 


I486 


50. 5 


285 


4 


US-10-317-574-13 


Sequence 13, Appl 


1486 


50. 5 


285 


4 


US-10-217-555-13 


Sequence 13, Appl 


1227 


41.7 


238 


3 


US-09-771-161A-154 


Sequence 154, App 


1146.5 


39.0 


752 


4 


US-10-108-60S-133 


Sequence 133, App 


1144.5 


38. 9 


752 


4 


US-10-190-012-29 


Sequence 29, Appl 


1144.5 


38. 9 


752 


5 


US-10-689-576-29 


Sequence 29, Appl 


1144.5 


38.9 


752 


6 


US-11-097-143-13B 


Sequence 138, App 


1144 . 5 


38 . 9 


752 


6 


US- 11 -097 -143 -573 


Sequence 573 , App 


1144 . 5 


38 . 9 


7S2 


6 


US- 11 -097- 143-28173 


Sequence 26173, A 


1144 . 5 


38.9 


755 


6 


US- 11 -097 -143-28170 


Sequence 28170, A 


973 


33.1 


539 


6 


US- 11 -097 -143-28176 


Sequence 28176, A 


973 


33 . 1 


539 


£ 


US- 11 -097 - 143 -28179 




940 


32.0 


636 


3 


US-09-205-658-160 


Sequence 160, App 


940 


32.0 


636 


3 


US-09-963-693-160 


Sequence 160, App 


935 


31.8 


632 


3 


US-09-205-658-1S9 


Sequence 159, App 


935 


31.8 


632 


3 


US-09-963-693-159 


Sequence 159, App 


917.5 


31.2 


473 


4 


US-10-369-493-13231 


Sequence 13231, A 


875.5 


29.8 


537 


4 


US-10-425-114-64601 


Sequence 64801, A 


672.5 


29.7 


537 


4 


US-10-425-114-67015 


Sequence 67015, A 


870.5 


29.6 


504 


4 


US-10-42S-11S-292727 


Sequence 292727, 


846. 5 


28.8 


628 


4 


US-10-369-493-22776 


Sequence 22776, A 



ALIGNMENTS 



RESULT 1 

US-09-771-161A-245 

Sequence 245, Application US/09771 16 1A 
Patent No. US2O020110811A1 
GENERAL INFORMATION: 
APPLICANT: LEV INK, et al. 

TITLE OP INVENTION: VARIANTS Of PROTEIN KINASES 

PILE REFERENCE : 802620-2005.1 

CURRENT APPLICATION NUMBER: US/09/771, 161A 

CURRENT FILING DATE: 2001-01-26 

PRIOR APPLICATION NUMBER: 09/724,676 

PRIOR FILINO DATE: 2000-11-28 

PRIOR APPLICATION NUMBER: 136776 

PRIOR FILING OATS: 2000-06-15 

PRIOR APPLICATION NUMBER: 135619 

PRIOR PILING DATE: 2000-04-12 

NUMBER OF SEQ ID NOS : 273 

SOPTWARB: Patentln version 3.0 



; SEQ ID NO 245 
; LENGTH: 556 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-771-161A-245 

Query Match 100.0%; Score 2941; DB 3; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

Matches 556; Conservative 0; Mismatches 0; Indole 0 



Oy 


l 


Db 


l 


oy 


61 


Db 


61 


or 


121 


Db 


121 


Qy 


181 


Db 


181 


Oy 


241 


Db 


241 


Oy 


301 


Db 


301 


Oy 


361 


Db 


361 


Oy 


421 


Db 


421 


Oy 


481 


Db 


481 


Oy 


541 


Db 


541 



llllllllllllillllilllllllllllillltlillllllllllllllllllllllll 



MM I III I III III I III !ll I III I Mil III I III III I 111 I Ml III I III I III 



1 1 ii 1 1 M 1 1 1 1 1 1 1 j 1 1 1 1 1 1 m 1 1 1 ii 1 1 1 1 1 1 1 r 1 1 r 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 



IIMIIMIIII III Mill III IMIMI I III I IIMIM MM III III I III I III 



IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIilllMIIIIIIIIIIIIII 



MMMMMII III MM I IMMI MM I III I IMIMMMIMI III I IIMIM 



1 1 1 1 F M 1 1 1 1 1 M I II M 1 1 J 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 f M 1 1 1 1 II 



MM I IIMMMII I III MM I IMMI I IIMIM III I III I MM MMIMIII 



lllllllllllllllllllllillllllllllllllllllllllllllllllllMIIII 

)PSGNAHKWCRKIQ 540 



lllllllllllllll 



RESULT 2 
US-10-190-012-1 

; Sequence 1, Application US/10190012 
; Publication No. US20030108971A1 
GENERAL INFORMATION: 

APPLICANT : Alessl, Dario R 



TITLE OF INVENTION: ENZYME 
NUMBER OP SEQUENCES: 35 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Jaeckle Fleischmann & Mugel, LLP 
STREET : 39 State Street 
CITY: Rochester 
STATE: New York 
COUNTRY: USA 
ZIP: 14614-1310 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Ploppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/190,012 
FILING DATE: 05-Jul-2002 
CLASSIFICATION: < Unknown > 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/08/943,667 
FILING DATE: 03-OCT-1997 
APPLICATION NUMBER: GB 9705462.1 
PILING DATE: 17-MAR-1997 
APPLICATION NUMBER: GB 9712826.8 
PILING DATE: 19-JUN-1997 
APPLICATION NUMBER: GB 9717253.0 
FILING DATE: 15-AUG-1997 
ATTORNEY / AGENT INFORMATION: 
NAME : Braman, Susan J 
REGISTRATION NUMBER: 34,103 
REFERENCE/ DOCKET NUMBER: 87792. 97R421 
TELECOMMUNICATION INFORMATION: 
TELEPHONE : 716-262-3640 
TELEFAX: 716-262-4133 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 556 amino acids 
TYPE: amino acid 
STRAND ED NESS : single 
TOPOLOGY: linear 
MOLECULE TYPB: protein 
HYPOTHETICAL: NO 
ANT I -SENSE : NO 
ORIGINAL SOURCE : 

ORGANISM: Homo sapiens 
; SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

US-10-190-012-1 



Query Match 100.0%; Score 2941; DB 4; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

Hatches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
ry 1 MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTBSSTPPG I PGGSRQGPAMDGTAABPRP 60 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

to 1 MARTTSQL YDAVP I QSSWLCSCPSPSMVRTQTBSSTPPG I PGGSRQOPAKDOTAAEPRP 60 

ry 61 GAGSLQHAQPPPQPRKKRPEDFKFG KI LGEGSPSTWLARSLATSRBYA I KI LB KRH 1 1 K 120 



Db 


61 


oy 


131 


Db 


121 


oy 


181 


Db 


181 


oy 


241 


Db 


241 


oy 


301 


Db 


301 


Oy 


361 


Db 


361 


Oy 


431 


Db 


421 


Oy 


481 


Db 


481 


oy 


541 


Db 


541 



in in i nun i iii i iii iiiiiiiiiii iii i iii in i mm ii i mi iii iii 



inn i iiiini in i in iiiiiiiiiii in i in in i in i in ii 



Mill 



lllllllllllllllllllltllllllllilllllllllllllllllllllllllllll 



llllllllllllltll Illlllillltlllllllllllllfllllllllllll 



iimii i iiiini mini iiiiiiiiiii iii i iii iii i iiiini mi ii i iii 



iiiini mini mini mi iinni iiiiiii m i mi in mi in in 



lllllllllllllllllllllllllllttllllllllMIIIIIIIIIIIIIIIIIIIII 



llllllllllllllllllMlllfllllltllllllllllllllllllflllllllllll 



llllllllllllllll 



RESULT 3 
US-10-376-554-6 
; Sequence 6 



Publication No. 



Application US/10376554 
US20O3021S849A1 
; GENERAL INFORMATION: 

APPLICANT: EXELIXIS, INC. 
TITLE OF INVENTION: PDPKls AS MODIFIERS OF THE p53 PATHWAY AND METHODS OF USE 
FILE REFERENCE: EX03-017C 

CURRENT APPLICATION NUMBER: US/10/376,554 
CURRENT FILING DATE: 2003-02-28 
PRIOR APPLICATION NUMBER: US 60/361,196 
PRIOR FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 6 
LENGTH: 556 



180 


Oy 


1 


1B0 


Db 


1 


240 


Oy 


61 


240 


Db 


61 


300 


oy 


121 


300 


Db 


121 


360 


Oy 


181 


360 


Db 


181 


420 


oy 


241 


420 


Db 


241 


480 


oy 


301 


480 


Db 


301 


540 


oy 


361 


540 


Db 


361 




oy 


421 




Db 


421 




Oy 


481 




Db 


481 




Oy 


541 




Db 


541 



ORGANISM: Hoi 
US-10-376-554-6 



> aapiena 



Beat Local Similarity 100.0%; Pred. No. 9.2e-213; 
Matchea S56; Conaervative 0; Mismatches 0; Indole 



lllittlllllllllllllllllilltllllllllllllllllllllllllllllllltl 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiitiiitiiiitiitiiiiiiiiiiii 



inn i inmiiii iiiiiiiiimnini NiiiNi ii i iiiiiii mi mi 



urn mmi i mi minim m minimi i in iiiiiiimimi 



inn i iimii mi iniiiiiimnnii mini i ii iiimn in i in i 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiiiiiiiiiiiiiiniiiii 



tiitiiiiiiiiiiitiiitiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiii 



inn i in mi in in i m i n mi i mm i m i ii i mi iiiiiiiiiii 



llllllllllllllll 



100.0%; Score 2941; DB 4; Length 556; 



RESULT 4 

US-10-704-921-14 

Sequence 14, Application US/10704921 
Publication No. US20040166099A1 
GENERAL INFORMATION: 
APPLICANT: Rao, Patricia 

TITLE OF INVENTION: MOLECULES PREFERENTIALLY ASSOCIATED WITH EFFECTOR T CELLS 
TITLE OF INVENTION: AND METHODS OF THEIR USE 
FILE REFERENCE : TLN-026CP 

CURRENT APPLICATION NUMBER: US/10/704,921 
CURRENT FILING DATE: 2003-11-10 
PRIOR APPLICATION NUMBER: 60/467477 
PRIOR FILING DATE: 2003-05-02 



PRIOR APPLICATION NUMBER ; 60/424777 
PRIOR FILING DATE: 2002-11-08 
NUMBER OF SEQ ID NOS: 15 

SOFTWARE: FastSEQ for Windowa Veraion 4.0 
SEQ ID NO 14 
LENGTH: 556 
TYPE: PRT 
; ORGANISM: Homo aapiens 
US-10-704-921-14 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches SS6; Conaervative 0 



Score 2941; DB 4 ; 
Pred. No. 9.2o-212 
Mismatches 0; 



Length 556; 



Oy 1 MARTTSQLYDAVPIOSSWLCSCPSPSMVRTQTESSTPPGI PGGSRQGPAMDOTAAEPRP 60 

II I I I II I I II II I II I I I t I I I I I I I I I I III I I I I I I I I I I I I I I I I I II I I II II I I 
Db 1 MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPGIPGGSRQGPAMDGTAAEPRP 60 

Oy 61 GAGS LOHAQ P P PQPRKKRP EDF KFGKI LGEG S FSTWLARELATSREYA I K I L E KRH 1 1 K 120 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 61 GAGS LQHAQP P PQPRKKR PEDFKFGKI LGEGS FSTWLARELATS RE YA I K I LE KRH 1 1 K 120 

Oy 121 BNKVPYVTRERDVMSRliM P FFVKLYFT FQDDE KL Y FGLSYAKNG ELLKY I RKI G S FDET 180 

iniimiinniinimimiiiini ninmiii niimiiniiiii 

0b 121 BNKVP YVTRE RDVMSRLDH P FFVKLYFTFQDDE KL Y FGLSYAKNGELLKY IRKIGSFDET 180 

Oy 181 CTRFYTAEIVSALEYLHGKQI IHRDLKPENILLNBDMHIQITDFGTAKVLSPE5KQARAN 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 CTRFYTAEIVSALEYLHGKGI IHRDLKPENILLNEDMHIQITDFOTAKVLSPBSKQARAN 240 



Db 


241 


S FVGTAQYVS P 5 LLTEKSACKSSD LWALGC I IYQLVAGLPPFRAGNEYLIFQKI IKLEYD 300 


oy 


l 


Oy 


301 


FPBKFFPKARDLVEIOXVLDATKRLGCEEMEGYGPLKAHPFFESVTWENLHCOT 360 


Db 


l 






IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 






Db 


301 


FPBKFFPKARDLV^IOXVI^TKRLGCTEMEGYGPLKAHPFFESVTWENLH 360 


Oy 


61 


Oy 


361 


YLPAMSEDD EDCYGNYDNLLSQFGCMQVS SSSSSHSLSASDTGLPQRSGSN I BQY I HDLO 420 


Db 


61 


0b 


361 


YLPAMSEDDEDCYGNY^^ 420 






oy 


421 


SNSFELDLQFSEDE1GILLLEKQAGGNPWHQFVENNLI LKMGPVDKRKGLFARRRQLLLTB 480 


Oy 

Db 


121 
121 






iiiiiiiiiiiiimmiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiii 






Db 


421 


SNSFELOLQFSEDEKRIXLEKQAGGNPWHQFVENNLI LKMGPVDKRKGLFARRRQLLLTB 480 


Oy 


1B1 


Oy 


481 GPHL YYVDPVNKVLKGE I PWSQBLRP EAKNFKTFFVHTPNRTYYLMD PSGNAH KWCRKI Q 540 


Db 


1B1 






IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHIIIIIII 






ob 


461 G PHLYYVDPVNKVLKGE I PWSQBLRPEAKNFKTFFVHTPNRTYYLMD PSGNAHKWCRKI Q 540 


oy 


241 


Oy 


541 


BVWRQRYQSHPDAAVQ S56 


Db 


241 






llllllllllllllll 






Db 


541 


EVWRQR YQSH PDAAVQ 556 


Oy 


301 








Db 


301 


RESULT S 










US-10-473 


-127 


1372 


Oy 


361 



1372, Application US/10473127 
; Publication No. US20040236091A1 
; GENERAL INFORMATION: 
APPLICANT: Zycoa Inc. 

TITLE OP INVENTION: TRANSLATIONS PROFILING 
FILE REFERENCE: 08191 -026WO1 
CURRENT APPLICATION NUMBER: US/10/473,127 
CURRENT FILING DATE: 3003-09-26 
PRIOR APPLICATION NUMBER: 60/279,495 
PRIOR FILING DATE : 2001-03-38 
PRIOR APPLICATION NUMBER: 60/392,544 
PRIOR FILING DATE: 2001-05-21 
PRIOR APPLICATION NUMBER: 60/310,801 
PRIOR FILING DATE: 2001-08-08 
PRIOR APPLICATION NUMBER: 60/326,370 
PRIOR PILING DATE: 2001-10-01 
PRIOR APPLICATION NUMBER: 60/336,780 
PRIOR FILING DATS: 2001-13-04 
PRIOR APPLICATION NUMBER: 60/356,985 
PRIOR FILING DATE: 2002-02-20 
NUMBER OP SEQ ID NOS: 2041 
SOFTWARE : FaatSEQ for Windowa Veraion 4.0 
SEQ ID NO 1372 
LENGTH: 556 
TYPE: PRT 

ORGANISM: Horn aapiena 
US-10-473-127-1372 

Query Match 100.0%; Score 2941; DB 5; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

556; Conaervative 0; Mismatches 0; Indole 0; 



lllllllllllllllllllllllllllllllllllllllllllimilllllllllll 



lllllllllllllllllllllllillllllllllllllllllllllllllllllllllll 
GAGSLQHAQPPPQPRKKRPBDFKFGKI LGEGS PSTWLARELATSREYA I KI LEKRH 1 1 K 120 

ENKVPYVTRBRIRMSBliJHP FFVKLYFTFQDDE IG,^ 180 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 



iiiiiiimiiiimiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiiniii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiii 



iiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiii 

FPBKFFP KARDLVB KLLVLDATKRLGCSEMEGYGPLKAHP PFBSVTWBNLHQQTP PKLTA 360 
361 YLPAMSEDDEDCTGNYONLLSQFGCMCJVSSSSSSHSLSASDTGLPQRSOSNI EQY I HOLD 420 



MUM llllillllll llll llll III lllllllllll III llll Ml MM lllllll 



Ob 361 YI^PAMS EDOEOCYGNYONUtSQFGQfQVSSSSSSHSLSASDTGLPQRSGSN I EQY I KDLO 420 


Qy 


121 


OV 421 SNSFELOLOFSEDSKRLLLEKQAOGNPWHQPVENNL I LKUGPVDKRKGLFARRRQLLLTE 480 


0b 


121 


IIIIIIIIIIIIIIMIIIIIItlllllllillllllll 


linn 


lllllll 


lllllll 






Ob 421 SNSFBLDLQF5EDBKRLLLHKQAGGNPWH0FVENNLILK 






IRQLLLTB 480 


Qy 


181 


OV 4B1 GPHLYYVDPVNKVLKGBIPWSQSLRPKAKNFKTFFVHTP 




MDPSGNAH KWCR K I Q 540 


Db 


181 


iiiiiiiiiiiiiiiiiiiiiiininiitiiniiiiiiiiit 


imimiiiini 






Db 481 GPHLYYVDPWKVLKCEIPWSQELRPEAKNFKTFFVHTP 


NRTYYL 


MDPSGNAH KKCRKIQ 540 


Qy 


241 


Oy 541 EVWRQR YQSH PDAAVQ 556 








Db 


241 


i i i ■ i i i i i i i i i i i i 
II 1 1 II 1 1 II 1 1 1 1 1 1 












Db 541 EVWRQR YQSH PDAAVQ SS6 








Qy 


301 










Db 


301 


RESULT 6 












US -10 -473 -127-1373 








Qy 


361 


; Sequence 1373. Application US/10473127 












; Publication No. US20040236091A1 








Db 


361 


; GENERAL INFORMATION: 












; APPLICANT: Zycos Inc. 








Qy 


431 


; TITLE OP INVENTION: TRANSLATIONS PROFILING 












; PILE REFERENCE: 08191-026WO1 








Db 


421 


; CURRENT APPLICATION NUMBER: US/ 10/4 73. 127 












; CURRENT FILING DATE: 2003-09-26 








Qy 


481 


; PRIOR APPLICATION NUMBER: 60/279.495 












j PRIOR FILING DATE: 2001-03-28 








Db 


4B1 


; PRIOR APPLICATION NUMBER : 60/292.544 












; PRIOR FILING DATS : 2001-05-21 








Qy 


541 


; PRIOR APPLICATION NUMBER: 60/310,801 










; PRIOR FILING DATE: 2001-08-08 








Db 


541 


; PRIOR APPLICATION NUMBER : 60/326.370 












; PRIOR PILING DATE: 2001-10-01 












; PRIOR APPLICATION NUMBER: 60/336,760 








RESULT 7 




; PRIOR FILING DATE: 2001-12-04 








US-10-473 


-127- 



PRIOR APPLICATION NUMBER ; 60/3S8.98S 
PRIOR FILING DATE: 2002-02-20 
NUMBER OF SEQ ID NOS : 2041 
; SOFTWARE: FaotSEQ for Windows Version 4.0 
; SEQ ID NO 1373 
; LENGTH: SS6 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-473-127-1373 

Query Match 100.0%; Score 2941; DB 5; Length 556; 

Beat Local Similarity 100.0%; Pred. No. 9.2e-212; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Oy 1 MARTTSQL YDAVP I QSSWLCS CPS P SMVRTQTES STPPG I PGGSRQGPAMDGTAAB PRP 60 

llllllllllllllllllllllllllllimilllllllllllllllllllllllllll 
Db 1 MARTTSQLYDAVP I QSSWLCS CPS P SMVRTQTES STPPG I PGGS RQGPAMDGTAAE PRP 60 

Qy 61 GAGSLCHAQPPPQPRKKRPEDFKFGKI LGEGSFSTWLARELATSREYAI KILEKRH 1 1 K 120 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 
Db 61 GAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGSFSTWLARELATEREYA I KI LEKRH 1 1 K 120 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
" " " - ' ~ ~ .YFTFQDDSKLYFGLSYAmSBLLKYIRKIGSFDBT 180 



IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 



IIIIMIM IIIIMillllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




> I LKX3 PVDKHKGLFARRRQLLLTB 480 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 



IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



llllllllllllllll 



Sequence 1374, Application US/10473127 
Publication No. US20040236091A1 
GENERAL INFORMATION: 
APPLICANT: Zycoa Inc. 

TITLE OF INVENTION: TRANSLATIONAL PROFILING 

FILE REFERENCE: 08191 -026WO1 

CURRENT APPLICATION NUMBER: US/10/473,127 

CURRENT FILING DATE: 2003-09-26 

PRIOR APPLICATION NUMBER: 60/279,495 

PRIOR FILING DATE: 2001-03-28 

PRIOR APPLICATION NUMBER: 60/292,544 

PRIOR FILING DATE: 2001-05-21 

PRIOR APPLICATION NUMBER: 60/310.801 

PRIOR FILING DATE: 2001-08-00 

PRIOR APPLICATION NUMBER: 60/326,370 

PRIOR FILING DATE: 2001-10-01 

PRIOR APPLICATION NUMBER: 60/336,780 

PRIOR FILING DATE: 2001-12-04 

PRIOR APPLICATION NUMBER: 60/358,985 

PRIOR FILING DATE: 2002-02-20 

NUMBER OP SEQ ID NOS: 2041 



; SOFTWARE: FaatSEQ tor Windows Version 4.0 
; SEO ID NO 1374 

LENGTH: 556 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-473-127-1374 

Query Match 100.0%; Score 2941; DB 5; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; 



lllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 



MM IMIII I IMIMMII III I llll III III I III I Ml III MM llllllllll 



MMMIMMMMMMMMMMMiMMMMMMIMM III IMIIIMM 



MIMIMMIIMMMIIMMMMMMMMIIMIMMIMM IMMIIMI 



Ml MIMMIIIIMM MM IMMIMMMI llll I III III I III III MM III 



J 1 1! 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 11 1 1 M 1 1 j 1 1 M I 1 1 1 ! J 1 1 1 1 1 



MM III III MM MM MM III MM llll II MM MM III I Ml llll IMIII 



lllllllllllllllllllllllllllllllllllllllllllllllllllltlllll 



llllllllllllllll > 1 1 M 1 1 II 1 1 1 1 IIIIIIIIIIIM 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


1B1 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


oy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


oy 


541 


Db 


541 



llllllllllllllll 



RESULT 8 

US-10-473-127-1377 

; Sequence 1377, Application US/10473127 
; Publication No. US2004023609LA1 
; GENERAL INFORMATION: 



APPLICANT: Zycoa Inc. 

TITLE OF INVENTION: TRANSLATIONAL PROFILING 

FILE REFERENCE: 08191 -026WO1 

CURRENT APPLICATION NUMBER : US/ 10/4 73 , 127 

CURRENT FILING DATE: 2003-09-26 

PRIOR APPLICATION NUMBER: 60/279,495 

PRIOR FILING DATE: 2001-03-28 

PRIOR APPLICATION NUMBER: 60/292,544 

PRIOR FILING DATE: 2001-05-21 

PRIOR APPLICATION NUMBER: 60/310,801 

PRIOR PILING DATE: 2001-08-08 

PRIOR APPLICATION NUMBER: 60/326,370 

PRIOR FILING DATE: 2001-10-01 

PRIOR APPLICATION NUMBER: 60/336.780 

PRIOR PILING DATE: 2001-12-04 

PRIOR APPLICATION NUMBER: 60/358,985 

PRIOR PILING DATE: 2002-02-20 

; NUMBER OF SEQ ID NOS: 2041 

; SOFTWARE: PastSBQ for Windows Version 4.0 

; SEQ ID NO 1377 

; LENGTH: 556 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-473-127-1377 



Query Match 100.0%; 
Best Local Similarity 100.0%; 

S56; Conservative 0 



Score 2941; DB 5; 
Pred. No. 9.2e-212; 
Mismatches 0; 



Length 556; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


oy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


oy 


361 


Db 


361 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



tllllllillllllllllllllllllllllllllllllllllllllllllllllllllll 



Mill llll MIMIII III llll III MMIMIMMIIIMM II lllllll llll I 



Mill llll MIMIII III llll III lllllllllll llllllll II lllllll Mill 



llll IIIMIIIIMIMIMIMIMM llllllllllll IMIIIMM llll llll 



II IIIIMI I llll MM 



MMMMMMMMMIIMMIMIIMMMMIMMMMIIMMMIMIM 



oy 
Db 
oy 



421 SNSFELDLQFSEDEKRLLLBKQAGGNPWHQPVENNLI L.KMGFVDKRXCLFARRRQLLLTB 400 

nun 1 111 nun 1 111 111 111 1 ii iiiiiii mi 111 iiiiii 111 in mm i 

431 SNSFBU3LQFSSDSKRUXBKCAOGNPWHQ7VENNLI LW4GPVDKRKGLPARRROLLLTB 480 
481 GPHLYYVDPVNKVlJCGBIPWSQSLJiPKAJaJPCT^ S40 

-Bi qrHLYYVDFVii^ 540 

541 BVHRQRYQSH POAAVQ S56 

II 1 1 1 1 1 M 1 1 II 1 1 1 

541 BWRQRYQSHPDAAVQ 556 



RESULT 9 

US-10-473-137-137B 

Sequence 1378, Application US/10473137 
Publication No. US30040336091A1 
GENERAL INFORMATION: 
APPLICANT: Zycoa Inc. 

TITLE OF INVENTION : TRANSLATIONS PROFILING 
PILE REFERENCE: 0B191-026WO1 
CURRENT APPLICATION NUMBER: US/10/473,137 
CURRENT FILING DATE: 2003-09-36 
PRIOR APPLICATION NUMBER: 60/379,495 
PRIOR FILING DATE: 2001-03-28 
PRIOR APPLICATION NUMBER: 60/393,544 
PRIOR FILING DATE: 3001-05-31 
PRIOR APPLICATION NUMBER: 60/310,801 
PRIOR FILING DATE: 3001-08-08 
PRIOR APPLICATION NUMBER: 60/336,370 
PRIOR FILING DATE: 3001-10-01 
PRIOR APPLICATION NUMBER: 60/336, 7B0 
PRIOR PILING DATE: 3001-12-04 
PRIOR APPLICATION NUMBER: 60/358,985 
PRIOR FILING DATE: 3003-03-30 
NUMBER OF SEO 10 NOS: 3041 
SOFTWARE: FastSEQ for Hindowa Veraion 4.0 
SEO ID NO 1378 
LENGTH: 556 



; ORGANISM: Homo sapiens 
US-10-473-127-1378 



Query Match 100.0%; 
Beat Local Similarity 100.0%; 
Matchea 556; Conservative 0 



Score 2941; DB 5; 
Pred. No. 9.2e-213; 
Mismatches 0; 



Length 556; 



Qy 1 MARTTSQL YDAVP I QSSWLCS CPS PSMVRTQTESSTP PG I PGGSRQGPAMDGTAAEPRP 60 

IIIIIIIMIIMIIIIIIIIIIIIIIIIIItllMIIIIIIIIIIIIIIMIIIIIIII 
Db 1 MARTTSQL YDAVP I QSSWLCS CPS PSMVRTQTESSTPPO I PGGSRQGPAMDGTAAEPRP 60 

Qy 61 GAGS LQHAQ P P PQP RKKRPED FKFG K I LGEGS FSTWLARELATSREYA I K I LBKRH 1 1 K 130 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 GAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGS FSTWLARELATSREYA I KI LEKRH 1 1 K 130 

Qy 121 ENKVPYVTRBRDVMSRLDH P F FVKL Y FTFQDD B KL Y FGLS Y A KNG E LLKY IRKIGSFDBT 180 

m i ii i iii iii i m iimini m i iii iii iii mi iii m iii m i iii i 



Db 


131 


Oy 


181 


Db 


181 


oy 


341 


Db 


241 


Qy 


301 


□b 


301 


Qy 


361 


Db 


361 


oy 


431 


Db 


431 


Oy 


481 


Db 


481 


oy 


541 


Db 


541 



X YPTFQDDEKLYFG LEY AKNQBLLKY IRKIGSFDBT 180 



1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 r 1 1 1 1 L 1 1 1 1 1 1 1 1 1 M I i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



mm iii mini iii iii immiii nimim mm in mi inn 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiinii 

JFGCMOVSSSSSSHSLSASDTGLPQRSGSNIEOYIHDLC 430 



PWHQFVBNNL1 LKMGPVDKRKGLFARRRQLLLTE 480 



IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 



miiiiiiimiii 



RESULT 10 

US-10-473-137-1380 

; Sequence 1380, Application US/10473137 
; Publication No. US30040336091A1 
; GENERAL INFORMATION: 
; APPLICANT: Zycoa Inc. 

; TITLE OF INVENTION: TRANSLAT I ONAL PROFILING 

; FILE REFERENCE: 08191 -036WO1 

; CURRENT APPLICATION NUMBER: US/ 10/473 , 137 

; CURRENT FILING DATE ; 3003-09-36 

; PRIOR APPLICATION NUMBER: 60/279.495 

; PRIOR PILING DATE: 2001-03-28 

PRIOR APPLICATION NUMBER: 60/292,544 
PRIOR PILING DATE: 2001-05-31 
PRIOR APPLICATION NUMBER: 60/310,801 
PRIOR FILING DATE: 2001-08-06 
PRIOR APPLICATION NUMBER: 60/336,370 
PRIOR FILING DATE: 3001-10-01 
PRIOR APPLICATION NUMBER: 60/336,780 
PRIOR FILING DATE: 3001-13-04 
PRIOR APPLICATION NUMBER: 60/358,985 
PRIOR FILING DATE: 3003-03-30 
NUMBER OP SSQ ID NOS: 3041 
SOFTWARE: FaatSBO for Hindowa Veraion 4.0 
SEQ ID NO 1380 
LENGTH : SS6 



; TYPE : PRT 

; ORGANISM: Homo sapiens 

US-10-473-137-1380 

Query Match 100.0%; Score 2941; DB 5; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.3e-213; 

Matches 556; Conservative 0; Mismatches 0; Indole 0; 



Oy 


l 


Db 


l 


oy 


61 


Db 


61 


oy 


121 


Db 


121 


Oy 


181 


Db 


181 


oy 


241 


Db 


241 


oy 


301 


Db 


301 


oy 


361 


Db 


361 


Oy 


421 


Db 


421 


Oy 


481 


Db 


481 


Oy 


541 


Db. 


541 



ii i imimimi III III Ml III I III Ml III 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 M I 



MiiiiMmiiimiMiimimiiimiiiimiiiiiii iiiiiiiiiii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



miiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiMiiiiimmiiiiiiii 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I ! 1 1 1 1 1 1 1 1 1 1 F 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



lllllllllllllllilllllllillllllllllllllllllllllllllllllllllll 



llllllllllllllllllllilllllllllllllllllllllllllllllllllllllll 



I Ill I IIIIIII 1111 II 



miiiiiiiiiiiiiiiiiiiiiiiiimiiiiimiimiiiiiiiiiiiiiiii 



iiiiiiiiiiiiiin 



RESULT 11 

US-10-473-127-13B2 

; Sequence 1383. Application US/10473137 
Publication No. US30040336091A1 
GENERAL INFORMATION: 
APPLICANT: Zycoa Inc. 

TITLE OF INVENTION: TRANSLAT! ONAL PROFILING 
FILE REFERENCE: 08191-036WO1 



CURRENT APPLICATION NUMBER: US/10/473,127 
CURRENT FILING DATE: 2003-09-26 
PRIOR APPLICATION NUMBER: 60/279,495 
PRIOR FILING DATE: 2001-03-2B 
PRIOR APPLICATION NUMBER: 60/393,544 
PRIOR FILING DATE: 3001-05-31 
PRIOR APPLICATION NUMBER: 60/310.801 
PRIOR PILING DATE: 3001-08-08 
PRIOR APPLICATION NUMBER: 60/336.370 
PRIOR FILING DATE: 2001-10-01 
PRIOR APPLICATION NUMBER: 60/336, 7 B0 
PRIOR FILING DATE: 3001-12-04 
PRIOR APPLICATION NUMBER: 60/358,985 
PRIOR FILING DATE: 3003-03-30 
NUMBER OP SEQ ID NOS: 3041 
SOFTWARE: PastSEQ for Hindowa Version 4.0 
SEQ ID NO 1383 

LENGTH: 556 

TYPE: PRT 

ORGANISM: Homo aapiena 
5-10-473-137-1382 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 556; Conservative 0 



Score 3941; DB 5; Length 556; 
Pred. No. 9.3e-313; 

Mismatches 0; Indela 0; 



oy 


1 


Db 


1 


oy 


61 


Db 


61 


oy 


121 


Db 


121 


Oy 


181 


Db 


181 


oy 


241 


Db 


241 


oy 


301 


Db 


301 


Oy 


361 


Db 


361 


oy 


421 


Db 


421 



iiiiii iiiiiiiiiiiiiiiiiiiiiiiMMmiiiimmiiiiiiiiiiiM 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiii 



iiiiii mi nil m in in mi iimmiiiiiimiiiiii mm inn 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiii 



IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIIIIIII 



1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 » 1 1 1 1 1 1 J 1 1 1 1 1 r f i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



IIMIIIIIIIMIMIMIMIIIIIIIIIIMIIMIMMIIMIIIMIIIlim 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIII 

„„, , , — .ILKMGPVDI0U5DLFARRRQLLLTB 480 



Qy 481 GPHLYYVDWNKVLKGBIPWSQBUIPEAKNFKTFFVHTPKRTYYLMDPSGNAHKWCSKIO 540 


Db 


161 


1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 t 1 1 1 1 1 i 1 i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 M 1 1 1 M 1 1 1 1 1 






Db 461 GPHLYYVDPVKKVLKGSIF^SQELRPEAKKPKTFFVHTPfreTYYL><DPSGNRHKHCRKIO 540 


Qy 


241 


Qy 541 EWRQRYQSHPDAAVQ 556 


Db 


341 


IIIIIIIIIIIIIIH 






Ob 541 BWRQR YQSH PDAAVQ 556 


Qy 


301 




Db 


301 


RESULT 12 






US-10-473 -127- 1363 


Qy 


361 


; Sequence 1383, Application US/10473137 




; Publication No. US20040236091A1 


Db 


361 


; GENERAL INFORMATION: 






; APPLICANT : Zycos Inc. 


Qy 


421 


; TITLE OP INVENTION: TRANS LAT I ONAL PROFILING 






; FILE REFERENCE: 06191-026WO1 


Db 


421 


; CURRENT APPLICATION NUMBER: US/ 10/4 73. 127 






; CURRENT FILING DATE: 2003-09-26 


Oy 


481 


; PRIOR APPLICATION NUMBER: 60/279,495 






; PRIOR PILING DATE: 2001-03-26 


Db 


481 


; PRIOR APPLICATION NUMBER: 60/292,544 






; PRIOR FILING DATS: 2001-05-21 


Qy 


541 


; PRIOR APPLICATION NUMBER: 60/310,601 






; PRIOR FILING DATE: 2001-08-06 


Db 


541 



PRIOR APPLICATION NUMBER: 60/326,370 
PRIOR FILING DATE: 2001-10-01 
PRIOR APPLICATION NUMBER: 60/336.780 
PRIOR FILING DATE: 2001-12-04 
PRIOR APPLICATION NUMBER: 60/3S8.98S 
PRIOR PILING DATE: 2002-02-20 
NUMBER OF SEQ ID NOS : 2041 
SOFTWARE: FaatSEQ for Windows Version 4.0 
SEO ID NO 1363 
LENGTH: 556 
TYPE: PRT 

ORGANISM: Homo eapiena 
US-10-473-127-1363 

Query Match 100.0%; Score 2941; DB S; Length 5S6; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

556; Conservative 0; Mismatches 0; Indels 0; 



iiiiiiiiiitiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiiiiiiiiii 



tiiiiiMiiiiiiiniiiitiiiiiiiiiiiiiiiiiiMiiiiiiiiniiiiiiii 



llllllllllltilllNllllllllllllllllllllllllllllllllltllllllll 



181 CTRFYTAB I VSALEYLHGKG I IHRDLKPENI LLNEDMHIQITDFGTAKVLSPBSKQARAN 240 



oy 


i 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 



MM I HI MM I Ml I MM I till MM III Mill III MM II HI I milll I 



1 1 MM Ml MM 1 1 II I Ml 1 1 MM IMIMMIIII III I III MIMI lllllll I 



lllllllllllllllllllllllllllllllllllllllilllllllillllllllllll 



lllllllllllllllllllllllllllilllillltllllllllllltllllllllllll 



J J 1 1 1 1 1 1 1 1 r 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 i 1 1 1 4 1 1 1 J 1 1 J 1 1 1 1 1 



I M 1 1 M 1 1 ) 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



i M 1 1 1 1 1 M 1 1 1 1 1 1 



RESULT 13 

US-10-473-127-1384 

Sequence 1384, Application US/10473137 
Publication No. US20040236091A1 
GENERAL INFORMATION: 
APPLICANT: Zycos Inc. 

TITLE OF INVENTION: TRANS LAT I ONAL PROFILING 
FILE REFERENCE: 08191 -02 6W01 
CURRENT APPLICATION NUMBER: US/10/473 . 127 
CURRENT FILING DATE: 2003-09-26 
PRIOR APPLICATION NUMBER: 60/279.495 
PRIOR FILING DATS: 2001-03-28 
PRIOR APPLICATION NUMBER: 60/292,544 
PRIOR PILING DATS: 2001-05-21 
PRIOR APPLICATION NUMBER: 60/310,801 
PRIOR FILING DATS: 2001-08-06 
PRIOR APPLICATION NUMBER: 60/326,370 
PRIOR FILING DATE: 2001-10-01 
PRIOR APPLICATION NUMBER: 60/336.780 
PRIOR FILING DATE: 2001-12-04 
PRIOR APPLICATION NUMBER: 60/358.965 
PRIOR FILING DATE: 2002-02-20 
NUMBER OF SEO ID NOS: 2041 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1384 

LENGTH: 556 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-473-127-1384 



Query Match 100.0%; Score 2941; DB 5; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MARTT SQL YDAVP I QS SWLCS CPS PSMVRTQTE SST P PG I PGGSRQG PAMDGTAAB P R P 60 






MMMMMMIMMIMM MMMMMMMIMMMMMMM MM III II 


Db 


1 


MARTTSQLYDAVP I QS SWLCS CPS PSMVRTQTESSTPPGI PGGSRQGPAMDGTAAEPRP 6 0 


Qy 


61 


GAGSLQHAQ PP PQPR KKRPED FKFGK I LGEGS FSTWLARELATSREYA I KI LE KRH 1 1 K 120 






1 MIIIIIIMIIHIIMIIMIIIIIIII liMMIIIIII IIIMIIIMM III II 


Db 


61 


GAGSLQHAQPP PQPR KKRPED FKFGK I LG EGS FSTWLAREUVTSREYA I KI LE KRH 1 1 K 120 


Qy 


121 


ENKVPYVTREROVMS RLDH P F FVKLYFT FQDD E KLYFGLSYAKNGELLKY I RKIGSFDET 180 






iiiHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiii 


Db 


121 


ENKVPYVTREROVMS RLDH P F FVKLYPTFQDD E KLYFGLSYAKNGELLKY I RKI GSFDET 180 


Qy 


181 


CTRFYTAB I VSALEYLHGKG 1 1 HRDLKPEN I LLNEDMH IQ I TDFGTAJCVLSPBSKQARAN 240 






MM MMMMMMMMMMMMIMM MIMMMMMIMMIMIIIMI 


Db 


181 


CTRFYTAB I VSALEYLHGKG I I HRDLKPEN I LLNEDMH IQ I TDFGTAKVLSPESKQARAN 240 


Qy 


241 


S FVGTAQ YVSP ELLTE KSACKSSD LWALGC 1 1 YQLVAGLP P FRAGNB YL I FQKI I KLEYD 300 






MM IMIIIIIIIIMIIMIIIIMI MMIMIMMIMMMMMMMMMI 


Db 


241 


S FVGTAQYVS P ELLTE KSACKS SDLWALGC 1 1 YQLVAGLPP FRAGNEYL I FQK 1 1 KLEYD 300 


Qy 


301 


F PE KFFPKARD LVE KLLVLDATKRLGCE EMEGYG PL KAH PFFESVTWENLHQQTP PKLTA 360 






IMMMIIMIIMMIMIIMIMI IIIIIIIIMMIIIMIIIMMIMMMI 


Db 


301 


FPEKFFPKARDLVEKLLVLDATKRLGCEEMEGYGPLKAHPFFBSVTWENLHQQTPPKLTA 3 60 


Qy 


361 


YLPAMSEDDBDCYGNYDNLI^QFGCMCVSSSSSSHSLSASDTGLPQRSGSNI EQY I HDLD 420 






1 IIIIIMIIIMIIIIMIIIIIMIMMIIIMMIIMMIIMIMIIII III II 


Db 


361 


YL PAMSSDDEDCYGNYDNLLSQFGCMQVS S S SSSH SLSASDTGLPQRSG SN I EQY I HDLD 420 


oy 


421 


SNS FE LDLQFSEDB KRLLLE KQAGGNPWHQFVENNL I LKMGPVD KRKGLFARRRQLLLTS 480 






IMIIIIIMIIMMIIMIMMIIMMIIIIMIIMMIIIIIIMIIIIMI II 


Db 


421 


SNS FELDLQFSED B KRLLLE KQAGGNPWHQFVENNL I LKMGPVD KRKGL FARRRQLLLTE 480 


Qy 


481 


GPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKWCRKI Q 540 






MMMIMIIIMIIMIMMMMMIMMMIMMMMMIIMIMIMI II 


Db 


481 


GPHLYYVDPVNKVLKGE I PWSQELRPBAKNFKTFFVHTPNRTYYLMDPSGNAHKWCRKI Q 54 0 


Qy 


541 


SVWRORYQSHPDAAVQ 556 






IIIIIIIIIIIIIIH 


Db 


541 


EVWRQRYQSHPDAAVQ 556 


RESULT 14 






US-10-723 


-860 


2053 



; Sequence 2053. Application US/10723660 
; Publication No. US200402S3606A1 



; GENERAL INFORMATION: 

; APPLICANT: Aziz, Natasha 

; APPLICANT: Gins burg, Wendy M. 

; APPLICANT: Zlotnik, Albert 

; TITLE OF INVENTION: Methods of Diagnosis of Soft Tissue Sarcoma, Compositions 
& 

; TITLE OF INVENTION: Methods for Screening for Soft Tissue Sarcoma Modulators 



; FILE REFERENCE: 05B82 . 0193 . NPUS01 

; CURRENT APPLICATION NUMBER: US/10/723 , 660 

; CURRENT FILING DATS: 2003-11-26 

; PRIOR APPLICATION NUMBER: 60/429,739 

; PRIOR FILING DATS: 2002-11-26 

; NUMBER OF SEQ ID NOS: 8393 

; SOFTWARE: Patentln version 3.2 

I SEQ ID NO 2053 

; LENGTH: 556 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-723-660-2053 



Query Match 100.0%; Score 2941; DB 5; Length 556; 

Best Local Similarity 100.0%; Pred. No. 9.2e-212; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


Tmimmnm 


60 


Db 


1 


MARTTSQLYDAVP IQSSWLCSCPSPSMVRTQTESSTPPG I PGGSRQGPAMDGTAAEPRP 


60 


Qy 


61 


GAGSLQHAQP P PQP RKKRP BD FKFG K I LGEGS FSTWLARELATSREYA I KI LEKRH 1 1 K 


120 






MUM IMMMMIMMMMMIMMIMIMMMMMMMMMMMIM 




Db 


61 


GAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGSFSTWLARELATSRBYAI KI LEKRH 1 1 K 


120 


Qy 


121 


BNKVPyVTRERDVMSRLDHPFFVKLYFTFOUDEKLYFGLSYAKNGBLLKY I RKIGSFDET 


180 






IIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIItllllllltlllllllllllllllll 




Db 


121 


ENKVPYVTRE RDVMSRLDHPFFVKLYFTFQDDEKLYFGLSYAKNGELLKY I RKIGSFDET 


180 


Qy 


181 


CTRFYTAB I VSALEYLHGKG I IHRDLKPENI LLNEDMHIQITDFGTAKVLSPBSKQARAN 


240 






t iiituiii ii miiiiuiiii 




Db 


181 


CTRFYTAB I VSALEYLHGKG I IHRDLKPENI LLNEDMHIQITDFGTAKVLSPBSKQARAN 


240 


Qy 


241 


S FVGTAQYVS PELLTE KSACKSSD LWALGCI I YQLVAGLPPFRAGNSYLI FQKI I KLEYD 


300 






lllllllllllllllllllllilllllllllllllllllllllllllllllillllllll 




Db 


241 


SFVGTAOYVSPELLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLEYD 


300 


Qy 


301 


FPE KFFPKARDLVE KLLVLDATKRLGCEEMEGYG P L KAH PFFESVTWENLHQQTP PKLTA 


360 






llllllllllllllllllillllllllllllllllllllllllllllllllllllllill 




Db 


301 


FPE KFFPKARDLVE KLLVLDATKRLGCEEMEGYG P L KAH PFFESVTWENLHQQTP PKLTA 


360 


Qy 


361 


YLPAMSBEDEDCYGNYDNLI^QFGCMQVSSSSSSHSLSASDTGLPQRSGSNI BQYIHDLD 


420 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllilll 




Db 


361 


YLPAMSEDDBDCYGNYDNLI^FGOIQVSSSSSSHSLSASDTGLPQRSGSNI EQYI HDLD 


420 


Qy 


421 


SNSFBLDLQFSBDEKRLLLBKQAGGNPWHQPVENNLILKMGPVDKRKaLFAR 


480 






IMIIIIMMMMMIIIIIIIMIIIMIIIIIIIIIIIMIIMIIIIIMIIII 




Db 


421 


STteFBLOLQFSEDBKRLLLBKCAGGNPWHQPVBNNL I LKMGPVD KRKGL FARRRQLLLTE 


4B0 


Qy 


481 


GPHLYYVDPVNKVLKGE I PWSQBLRPBAKNFKTP FVHTPNRTYYLMD PSGNAHKHCRKXQ 


540 






INN Illllllllllllllllllllll IIMMMMMIMIMM 




Db 


481 


aPHLYYVDPVNKVLKDB I PWSQEIAPBAKNFCTFFVHTPNRTYYLMDPSaNAHKWCRKIQ 


540 


Qy 


541 


EVWRQRYQSHPDAAVQ 556 






llllllllllllllll 




Db 


541 


EVWRQRYQSHPDAAVQ 556 





Katchea 556 



US- 10 --689 -576-1 


oy 


1 


. sequence i. Application US/ 10689576 




, ruuiicaLion no. uo4Uusuuj4AD5A1 


Db 


1 








, Ai'KUiuiBi : AlUll, uano K 


oy 


1 


» TITLB OP INVENTION ; ENZYME 












moBCCDrtuncurc & nno tec . 
j LuKKba KUWUt.fJL.JJ MJUKSS9 : 






; AUuKttdaoa ; usecjcie riexacnmann « nu^Qi f m*r 


oy 


121 


; STREET: 39 State Street 






; CITY: Rocheator 


Db 




j nib. NOW TOTK 








oy 










COMPUTER READABLE PORK : 


Db 




, nouiun lira . r loppy aiiiv 






; COMPUTER: IBM PC cotm>atible 


oy 


241 








. SOFTWARE: Patent In Releaae 41.0, Vera ion #1.30 


Db 




CURRENT APPLICATION DATA: 






• iDor Ti*»Tinw uinrnCD. nc/m/mo cit 
, ArrLlLAHUN NUnluK. Uo/ iv/ DOJ, 3/e 


oy 


301 


• PILING DATE' 2 o -Oct - 2 0 0 3 








Db 


301 


; PRIOR APPLICATION DATA: 






J APPLICATION NUMBER: US/ 08/ 94 3 i 667 


oy 




I FILING DATE: 0 3 -OCT - 1997 






i At'funj*! lufl nunobK : Kto 7/U94Q4.1 


Db 


361 








j APPLICATION NUMBER: GB 9713826.8 


oy 


421 


i FILING DATE: 19-JUN-1997 






i APPLICATION NUMBER: GB 9717253.0 


Db 


421 


i PILING DATE: 15-AUG-1997 






; ATTORNEY/AGENT INFORMATION: 


Oy 


4B1 


; NAME: Bremen, Susan J 




i REGISTRATION NUMBER: 34,103 


Db 


481 


; REFERENCE/ DOCKET NUMBER: 87792. 97R421 






; TELECOMMUNICATION INFORMATION: 


oy 


541 


i TELEPHONB: 716-362-3640 




; TELEFAX: 716-262-4133 


Ob 


541 



INFORMATION FOR SEO ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 556 amino acida 
TYPE: amino acid 
STRAND EDNESS : eingle 
TOPOLOGY: linear 
MOLECULE TYPB: protein 
HYPOTHETICAL: NO 
ANTI -SENSE: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo eapiena 
SEQUENCE DESCRIPTION: SEO ID NO: 1: 
US-10-689-576-1 



Conservative 



0; Mismatches 0; Indole 0; Gaps 



KARTTS0LYDAVPIQSSVVU^CTSPSMWTOTESSTProiPC<^ 60 
lllllllllllllllltllllllllllllltlllillltlllllllllllllllllllll 
MARTTSQLYDAVPIOSSVVIX^CPSPSMVRTOTESSTPWIPOGSRQaPAMIXTrAAEPRP 60 

QAGSlXSiAQPPF'QPRKKRPEDrKFGKI L^IEGSFST\A/1ARBLATSREYAIKILSKRHI I K 120 
lllllllllllllllllllllllllllllllllllllllllllllltlllllllllllll 
GAGSLQHAQ P P PQ PR KKRPED PKFG KI LGEG S FSTWLARELATSRBYA I KI LBKRH I IK 120 

ENKVPYVTRERDVKSRLDH PFFVKL Y PTFQODB KLYFGLSYAKNGSLLKY I RKI GS FDBT 180 
lllllllllllllllllltllllllltlilltllllllllllllllllilllllilllll 
BNKVPYVTRERDVMS RLDH PF PVKL YFTFQDDE KLYFGLSYAXNGELLKY IRKIGSPDBT 180 

CTRFYTASIVSALBYLHGKOIIHRDLKPENILLNEDMHIQITDFGTAK^ 240 

lllllllllllllllllllllllllllltlllllltllltllllllllllllllllllll 
CTRPYTAEIVSALEYLHGKDI IHRDLKPENI LIjNEDMH IQITDFGTAKVLSPESKQARAN 240 

SPVGTAQVVSPBl^TBKSACKSSDLWALOCI I YQLVAGLP PFRAGNEYL I FQKI I KLBYD 300 
llllilllllllllllllllllllllllllllllllllllllllllllllllllllllll 
SFVGTAQYVSPELLTBKSACKS5DLKALGCI I YOLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

FPE KFFPKARDLVB KLLVLDATKRLGCSEMEGYG PLKAH PFFBSVTVfENLHQQTPP KLTA 360 

lllllllllllllllllllllllllllllllllllllll Ml Mil llllllllllllll 

FPBKFFPlCARDLVBKIXVUlATKRUXSEMEGYGPLKAHPFFESVrWE^ 360 

YLPAMSSDDSDCYGNYDNI4LSQFGCMOVSSSSSSHSLSASDTGLPQRSGSNI SOY I HDLD 420 
llllllllllflltllllllllllllllillllllltlllllllllllllllllllllll 
YLPAMSEDDEDCTGNYDNLLSQFTKMOySSSSSSJISLSASDTGLPQRSGSNI E0Y I KDIX) 420 

SNSPELDLQFSEDEKRU^KQAGGNPWHQFVENNLILKMGPW 460 

I Ml I II I III III IIIMM I III MINIM II I IIIIIIMII III I III III Mil 

SNSFBUILQFSEDBKRLIJ^KQAGGNPwHQFv^NNLILKMGPTO 480 

G PHL YYVD PVNKVL KG B I PWSQBLR P BAKNFKT FPVHTPNRTYYLMD PSGNAH KWCRKI Q S40 
I I I I I 1 I I I I I I I I I I I I I I I J t I I I I I I I I I I I I I I I I I t I I I I I I I 1 I I I I I I I I I I I 
GPHLYYVDPVNKVLKGBIPWSQEIJU'EAIO^KTFFVHTPNRTYY 540 



I 1 • I I I I I I I I I I I • 



Search completed: June 37, 2006, 01:51:45 
Job time : 1B0 aeca 



Query Match 100.0%; Score 2941; DB 5; Length S56; 

Beet Local Similarity 100.0%; Pred. No. 9.2e-212; 



ON prot«in 
Run on: 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



protein search, using aw model 

June 27, 2006, 01:36:37 ; Search time 51 Seconds 
(without alignment*) 
954.255 Million cell updatea/aec 



Title: 

Perfect score: 
Sequence : 



US-10-689-S76-1 
2941 

1 MARTTSQLYDAVPIQSSWL R Kl QSVWRQR YQSH PDAAVQ 556 



Scoring table: 8LOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 650591 seqs, 6753062B residues 

Total number of hits satisfying chosen parametei 

Hinimim OB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Miniimim Match 0* 

Maximum Match 100% 
Listing first 45 sunnaries 



1: 



Issued Patents AA : • 

/EMC CelerrI SIDS3/ptodata/2/iea/S_COKB.pep: * 
/EMC~Celerra~SIDS3/ptodata/2/iaa/6~COMB . pep : * 
/ BMC_Co 1 err«_S I DS3 / ptodata/ 2 / i aa/ 7_COMB . pep : * 
/EMC_Celerra_SIDS3/ptodata/2/iaa/H_C0KB . pep : • 
/ EMC~Celerra~S IDS3/ptodata/2/ iaa/PCTUS_COMB . pep : • 
/ BMC ~Ce 1 e rra_S I DS3 /ptoda ta/ 2 / i aa/RB_COMB . pep : • 
/EMC_Celerra SIDS3/ptodata/2/iaa/backf ileal .pep: • 



No. is the number of results predicted by chance to have a 
greater than or equal Co the score of Che result being printed, 
is derived by analysis of the total score distribution. 



11 


614 


20. 9 


113 


3 


US-09-20S-65B-173 


12 


573 


19.5 


104 


2 


US -09-205-658-202 


13 


571 . 5 


19.4 


420 


2 


US -09- 94 9-016-1X701 


14 


562 


19.1 


431 


2 


US-09-031-295-2 


15 


562 


19.1 


431 


2 


US-10-000-039-2 


16 


561 


19. t 


407 


2 


US-10-067-977-4 


17 


561 


19.1 


445 


2 


US-10-067-977-2 


18 


561 


19. 1 


526 


2 


US- 10 -0 94 -74 9- 1861 


19 


560.5 


19.1 


2S6 


2 


US-09-964-956-72 


30 


560. S 


19.1 


358 


2 


US- 09-94 9-0 16- 65B2 


21 


560. 5 


19.1 


358 


2 


US-10-427-923-4 


22 


S56 


18 . 9 


398 


2 


US- 0 9-53 8 -092 -501 


23 


556 


18. 9 


398 


2 


US-0 9-4 87-55BB-404 


24 


555 


18 . 9 


430 


1 


US-06-712-709-9 


2S 


555 


18.9 


430 


2 


US-09-111-444-9 


26 


555 


18.9 


430 


2 


US-09-541-228-9 


27 


554 


18.6 


3 B0 


2 


US-09-487-55BB-402 


28 


553 


18.8 


431 


1 


US-08-712-709-5 


29 


553 


18. B 


431 


2 


US-09-111-444-5 


30 


553 


18. B 


431 


2 


US-09-541-228-5 


31 


550.5 


18.7 


350 


2 


US -09-538-0 92 -991 


32 


S50.5 


18 .7 


359 


2 


US -09 -94 9-016-7862 


33 


543 


18 . 5 


343 


2 


US -09-3 94 -455- IS 


34 


54 2 


18 .4 


336 


2 


US -09-3 94 -455-2 


35 


542 


18.4 


343 


2 


US -09 -3 94 -4 55 -34 


36 


542 


18 .4 


350 


2 


US -09 -S3 8 -092 -946 


37 


542 


18.4 


351 


2 


US-09-394-455-4 


38 


542 


18.4 


351 


2 


US-09-771-161A-194 


39 


540 


18.4 


343 


2 


US-09-394-455-3Q 


40 


540 


18.4 


595 


2 


US-09-417-197-69 


41 


539. S 


18.3 


S41 


2 


US-09-205-658-1S4 


42 


538 


18.3 


350 


2 


US-09-457-040B-37 


43 


538 


18.3 


351 


2 


US-09-457-040B-6 


44 


537 


18.3 


260 


1 


US-07-857-224B-4 


45 


535.5 


18.2 


637 


2 


US-09-817-310-2 



ALIGNMENTS 



RESULT 1 
US-09-016-000-4 











SUMMARIES 




; Sequence 4, Application US/09016000 




% 










; Patent No. 5962232 




Query 










; GENERAL INFORMATION: 


Score 


Match Length DB 


ID 


Description 


; APPLICANT: Hillman, Jennifer L. 














; APPLICANT: Lai, Preeti 


2941 


100.0 


556 


1 


US-09-016-000-4 


Sequence 4, Appli 


; APPLICANT: Bandman, Olga 


2941 


100.0 


556 


2 


US-08-943-667-1 


Sequence 1. Appli 


; APPLICANT: Akerblom, Ingrid E. 


2941 


100.0 


556 


2 


US-09-771-161A-245 


Sequence 245, App 


; APPLICANT: Shah, Purvi 


2937 


99.9 


5S6 


2 


US-09-156-793D-2 


Sequence 2, Appli 


; APPLICANT: Corley, Neil C. 


1227 


41.7 


238 


2 


US-09-771-161A-1S4 


Sequence 154, App 


; APPLICANT: Guegler, Karl O. 


1144.5 


38.9 


752 


2 


US-08-943-667-29 


Sequence 29, Appl 


1 TITLE OP INVENTION: PROTEIN KINASE MOLECUI 


940 


32.0 


636 


2 


US-09-205-658-160 


Sequence 160, App 


; NUMBER OP SEQUENCES: 12 


935 


31.8 


632 


2 


US-09-20S-658-159 


Sequence 159, App 


; CORRESPONDENCE ADDRESS: 


713 


24.2 


766 


2 


US-09-538-092-216 


Sequence 216, App 


; ADDRESSEE: Incyte Pharmaceuticals. Inc. 


655.5 


22.3 


901 


2 


US-09-248-796A-2049S 


Sequence 20495, A 


; STREET: 3174 Porter Drive 



Sequenct 
Sequenct 
Sequenct 
Sequenct 
Sequenct 
Sequenct 



Sequenct 
Sequenei 
Sequenci 
Sequenci 
Sequenc< 
Sequeno 
Sequenci 
Sequenc* 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 



Sequenci 
Sequenci 
Sequenci 
Sequenci 
Sequenci 



173, App 
202, App 
11701, A 

3. Appli 
2, Appli 

4, Appli 
2. Appli 
1861. Ap 
7*, Appl 
6582, Ap 

4, Appli 
S01, App 
404, App 
9, Appli 
9. Appli 
9, Appli 
402, App 

5, Appli 
5, Appli 

5, Appli 
991, App 
7662, Ap 
15, Appl 
2, Appli 
34, Appl 
946, App 
4, Appli 
194, App 
38. Appl 
69, Appl 
154. App 
37, Appl 

6, Appli 
4, Appli 
2, Appli 



CITY: Palo Alto 
STATE : CA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER RSADABLB FORM: 
MEDIUM TYPE : Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: PastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016.000 
PILING DATE: HEREWITH 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
PILING DATE : 
ATTORNEY/ AGENT INFORMATION: 
NAME: Billings, Lucy J 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PP-0465 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 650-B55-055S 
TELEFAX: 650-845-4166 
TELEX: 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 556 amino acids 
TYPE: amino acid 
STRAND ED NESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE : 

LIBRARY: MMLR1DT01 
CLONE: 472480 
US-09-016-000-4 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 556; Conservative 0 



Score 2941; DB 1; Length 556; 
Pred. No. 1.4e-247; 

Miematches 0 ; Indels 0 ; 



Qy 1 MART/TSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPGI PGGSRQGPAMDGTAABPRP 60 

iiiimiiiiiiiiiiiiiiiiiimiimiii iiiifiiiiiiiimm 

Db 1 MARTTSQLYDAVPIQSSVVliCSCPSPSMVRTQTESSTPPGI POOSRQGPAMDOTAABPRP 60 

Oy 61 GAGSLQHAQP P POPRKKRPED FKFG KI LGEGS FSTVVLARELATSREYA I KI LE KRH 1 1 K 120 

llllllllltlllllllllllllllllllllllllllltlllllllllllllllllllll 
Db 61 GAGSLQHAQ P P PQ P RKKRPED FKFG KI LGEGS FSTWLARELATSRSYA I KI LE KRH 1 1 K 130 

Qy 121 ENKVPYVTRBRDVMSRLDHPFFVTa^YFTFCflDEKLYFGLSTAKNGEIXICYIRKIGSFDCT 180 

lllllllllllltllllllllllllllllllllllllillllllllllllllllllllll 
Db 121 ENKVPYVTRERDVMSRLDH P PPVKLYPTFQDDE KLYFGLSYAKNGELLKY I RKIGSFDST 180 



161 CTRFYTASIVSALBYLHGKGI IHRDLKPENILLNBDMHIQITDFGTAKVLSPESKQARAN 240 

III Mllllll llllll I llll [Hill I Illlll I III I IN III I IMIIM Nil II 

181 CTRFYTABIVSALEYLHGKGI IHRDLKPENILt^NED*WIQITDFOTAXVLSPESKGARAN 240 
241 SFVGTAOYVSPSLLTEKSACKSSDLHALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

lllllllllllillMlllllilllllllllllllltllllillllllllllllllllll 



Db 

Qy 



Oy 
Db 

oy 

Db 

Qy 

Db 



241 S FVGTAQYVS PE LLTE KSACKS SDLWALG C 1 1 Y0LVAGLP P PRAGNEYL I FQK 1 1 KLEYD 300 

301 FPBKFFPKARDLVEKLLVLDATKRLGCEEMEGYGPLKAHPFFESVTWEN^ 360 

Itllllllllllllllllllllllltllllllllltlllllllllllllllllllllltl 
301 FPB KF F PKARDLVE KLLVLDAT KRLGCE EMEGYGPLKAH P F FESVTWENLHQQTP PKLTA 360 

361 YL PAMSEDDEDCYGNYDNLLSQFGQ40VSS SSSSHSLSASDTGLPQRSGSN I EQY I HOLD 420 

llllllllllllltlMlllllllllllllllllllllllllllllllllllilllllll 
361 YLPAMSEDDEDCYGNYDNLLSQPGOtOVSSSSSSHSLSASDTGLPQRSGSNI EQY I HDLD 4 20 



421 SNSFELDLQFSEDS KRLLLEKQAGGNFWHQFVENNL I L KMC PVDKRKGLFARRRQLLLTE 480 

IIIMIMMIIIIII Mllllll Mllllllllllllllll IIMIIIIIIIIIIIIM 

421 SNSFELDLQFSEDB KRLLLB KQAOGNPWHQFVENNL I LKMG PVDKRKGLFARRRQLLLTE 480 

4B1 GPHLYYVDPVNKVLKGB I PWSQELRPSAKNFKTPFVHTPNRTYYLMDPSGNAH KWCRKI Q 540 

lllllllltillllllllilllllllllililllilllllllllllllllllllllllll 
481 GPHLYYVDPVNKVLKGB I PWSQELRPBAKNFKTPFVHTPNRTYYLMDPSGNAHKWCRKI Q 540 

541 BVWRQR YQSH PDAAVQ 556 

llllllllllllllll 
541 SVWRQRYQSH PDAAVQ 556 



RESULT 2 
US-08-943-667-1 

Sequence 1, Application US/08943667 
Patent No. 6734001 
GENERAL INFORMATION: 

APPLICANT: Alessi, Dario R 
TITLE OP INVENTION: ENZYME 
NUMBER OP SEQUENCES : 35 



Jaeckle Pleischmann b Mugel, LLP 
STREET: 39 State Street 
CITY: Rochester 
STATE: New York 
COUNTRY: USA 
ZIP: 14614-1310 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS/MS -DOS 
SOFTWARE: Patentln Release ll.O. Version 11.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/943,667 
PILING DATE: 03-OCT-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: GB 9705462.1 
FILING DATE : 17-MAR-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: GB 9712626.6 
FILING DATE: 19-JUN-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: GB 9717253.0 
FILING DATE: 15 -AUG- 1997 
ATTORNEY/ AGENT INFORMATION: 



» 



HAMB: Bremen, Susan J 

REGISTRATION NUMBER: 34,103 

REFERENCE/DOCKET NUMBER: 87792. 97R421 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 716-363-3640 

TELEFAX: 716-263-4133 
INFORMATION TOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 556 amino acid* 

TYPE: amino acid 

STRANDSONSSS : aingla 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
HYPOTHETICAL: NO 
ANTI -SENSE: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
-08-943-667-1 



Quary Match 100.0%; Scora 2941; DB 2; Length 556; 

Best Local Similarity 100.0%; Pred. No. 1.4e-347; 

Matches 556; Conservative 0; Mismatches 0; Indals 0; Gaps 0; 

Qy 1 MARTTSQLYDAVPIQSSVV1X3CPSPSMVRTQTESSTPPGIPGGSROGPAMDGTAAEPHP 60 

lllllllllllllllllllllllillllllllllllllllllllllllllllllllllll 
0b 1 MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPGI PGGSROGPAMDOTAAEPRP 60 

Oy 61 GAGSLQHAQPPPQPRKKRPEDFKFOK1 LGEGSFSTVVLARBLATSRBYA I KI LEKRH 1 1 K 120 

61 GAGS LQHAQPpLIjPR JO^ KI LEKRH 1 1 K 13 

Oy 121 ENKVP YVTRERDVMS RLDH P PFVKLYFTFQDDB KL YFG LSYAKNGELLKY I R K I GSFDET 180 

llllllllllllllllllllllllllllltillllllllltllllllllllllllillll 
Db 121 BNKVPYVTRERDVMSRLDH P FFVKL YFTFQDDE KLY FGLSYAKNGELL KY I RK I GS FOBT 180 

Oy 181 CTRFYTAEIVSALEYLHGKGI IHRDLKPENI LLNEDMH IQITDFGTAKVLSPESKOARAN 240 

Illllllllilllllil II Illlllllllllllllllll 

Db 181 CTRFYTAEIVSALSYLHGKOIIHROLKPENILLNEDMHIOITDFGTAKVLSPBSKOARAN 240 

Oy 241 SFVGTA0YVSPBLLTEK5ACKSSDLNALGCI IYQLVAGLPPFRAGNEYLIFQKI IKLBYD 300 

llllllllllllltlllllllllllllllllllllllllMIIIIIIIIIIIIIIIIMI 
Db 241 SFVGTAQYVSPBLLTEKSACKSSDLWALGCI IYQLVAGLPPFRAGNEYLIFQKI IKLBYD 300 

Qy 301 FPB KF FPKARD LVE KLLVLDAT KRLGCB EMEGYG PLKAH P FFBSVTWENLKQOTPP KLTA 360 

llillllllllllllllllllllllllllllillllllf Mill llllllll I III I Ml 
Db 301 FPB KFFP KARD LVE KLLVLDAT KRLGCE EMEGYG PLKAH P FFBSVTWENLHQQTP PKLTA 360 

Qy 361 YLPAMSEDDEDCYGNYDNLLSQ FGCMQVSSS SSSH SLSA5DTGLPQRSGSN I EQY I KDLD 420 

M I I M 1 1 M 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 II 

Db 361 YLPAMSEDDEDCYGNYDNLLSQPGCMQVSSSSSSHSLSASDTGLPQRSGSNIBQYIHDLD 420 

421 SNSFBLDLOFSEDEKRLL^ 480 

Db 421 SNS FELDLQFSEDB KRLLLB KQAGGNPWHQFVENNL I LKMGPVDKRKGLFARRRQLLLTE 480 

Qy 481 G PH LYYVDPVNKVL KGE I PWSQE LRPEAKNF KTF FVHTP NRTYYLMDPSGNAH KWCR K I Q 540 

llllllllllllllllllllllllllimilllllllllllllllllllllllilllll 



541 BVWRQRYQSHPDAAVQ 556 

llllilllllllllll 
541 BVWRQRYQSHPDAAVQ 556 



RESULT 3 

US-09-771-161A-245 

; Sequence 245. Application US/09771161A 
Patent No. 6936450 
GENERAL INFORMATION: 
APPLICANT: LEVINS, at al . 

TITLE OF INVENTION: VARIANTS Of PROTEIN KINASES 

FILE REFERENCE: 802620-200S. 1 

CURRENT APPLICATION NUMBER: US/09/771, 16 1A 

CURRENT PILING DATS: 2001-01-36 

PRIOR APPLICATION NUMBER: 09/724,676 

PRIOR FILING DATE: 2000-11-28 

PRIOR APPLICATION NUMBER: 136776 

PRIOR FILING DATS: 2000-06-15 

PRIOR APPLICATION NUMBER: 135619 

PRIOR FILING DATE: 2000-04-13 

NUMBER OF SEQ ID NOS: 273 

SOFTWARE: Patent In version 3.0 
SEQ ID NO 345 
; LENGTH: 556 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-771-161A-245 



Length S56; 



Query Match 100.01; Score 2941; 

Best Local Similarity 100.0%; Pred. No. 1.4*- 247; 
Matches 556; Conservative 0; Mismatches 0; Indels 



0; Gaps 



Qy 1 MARTTSQL YDAVP I QS SWLCSCPSPSMVRTQTESSTPPG I PGGSRQGPAMDGTAAEPRP 60 

II1IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII1IIIIIIIIIIIIIIIIIII 
Db 1 MARTTSQL YDAVP I QS SWLCSCPSPSMVRTQTESSTPPG I PGGSROGPAMDOTAAEPRP 60 

Qy 61 GAGSIXJHAQPPPQPRKKRPEOFKFGKILGEGSFST^AnJ^LATSREYAIKILEKRHIIK 120 

IIMII Mill M I III I III I llllll I III I MM II IIIIIIMMMIII MM II 

Db 61 GAGSLQHAQP P PQ PR KKR PED FKFG KI LGBGS FSTWLARELATSRBYA I KI LE KRH 1 1 K 120 

Qy 121 BNKVPYVTRERDVMSRLDH PFFVKLYFT FQDDE KLYFGLSYAKNGBLLKY I RKI GS FDET 180 

Mill IIIIIIIIIIIMM llllllllll MIMIIIIIIIIIIIMIIMIIIIIIII 

Db 121 BNKVPYVTRERDVMSRLDH P FFVKLYFTFQDDB KLYFGLSYAKNGBLLKY I RKIGS FDET 180 



Qy 1B1 CTRFYTAEIVSALEYLHGKGI IKRDLKPENILLNEDMHIQITDFGTAKVLSPBSKQARAN 240 

MIMIIMIIIMMIMMMIIMIMMIMIMIIIMIIMIMMIIIMIII 

Ob 181 CTRFYTASIVSALBYLHGKGIIHRDLKPENILLNSDMHIQITDFGTAKVLSPBSKQARAN 240 

241 ^^^^^^^^|^^^^^^j > ^ , | 1 ^^^| ly^^^j'^^^^^l'l ^jfj | M*MM 
Db 241 SFVGTAQYVSPBLLTEKSACKSSDLVlALaCI IYQLVAGLPPFRAGNEYLIFQKI I KLBYD 300 

Qy 301 FP E KFF PKARD LVE KLLVLDAT KR LG CE EMEGYG P LKAH PFFESVTWBNLH QQTP PKLTA 360 

MIMIIIIIIMIIMII llllll IIIIIIIIIIIIIIIIIIIIIIIIIMI 



Db 301 FPE KFF P KARD LVE KLLVLOATKRLGCEEMEGYGPLKAH P FFESVTWENLHQQTP PKLTA 360 

Qy 361 YLPAMSEDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTGLPQRSGSNI EQY I HDLD 420 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 r M II I M 1 1 1 r 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 M 

Db 361 YLPAMSEDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASOTGLPQRSGSNI EQY I HDLD 4 20 

Qy 421 SNSFELDLQFSEDEKRLLLEKQAGGNPWHQFVENNLI LKMGPVDKRKGLFARRRQLLLTE 480 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 421 SNS FELDLQFSED SKRLLLE KOAGGNPWHQ FVENNL I L KMG PVDKRKGLFARRRQLLLTS 480 

Qy 481 GPHLYYVDPVNKVLKGE I PWSQBLRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKWCRKI Q 540 

lllllllllllllllllllllllllllllllllllllllllllllllillllllllllll 
Db 481 GPHLYYVDPVNKVLKCB I PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKHCRKI Q 540 

Qy 541 EVWRQR YQSH PDAAVQ SS6 

llllilllllllllll 
Db 541 EVWRQR YQSH PDAAVQ 556 



RESULT 4 

US-09-156-793D-2 

Sequence 3, Application US/09156793D 
Patent No. 6683920 
GENERAL INFORMATION: 

APPLICANT: Stephens, Len 

APPLICANT: Hawkins, Philip T. 

APPLICANT: Stokoe, David 

TITLE OF INVENTION: Compositions and Methods for Identifying PKB Kinase 
TITLE OF INVENTION: Inhibitors 
FILE REFERENCE: 1030-US 

CURRENT APPLICATION NUMBER: US/09/156 . 793D 
CURRENT FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/060.190 
PRIOR FILING OATS: 1997-09-36 
NUMBER OF SEQ ID NOS: 3 
SOFTWARE: Patentln Ver. 3.0 
SEQ ID NO 2 
LENGTH : 556 
TYPE: PRT 
; ORGANISM : PKB Kinase 
US-09-156-793D-2 



Query Match 99.9%; 
Best Local Similarity 99.8%; 
Matches 555; Conservative 



Oy 


1 


Ob 


1 


oy 


61 


Db 


61 


Qy 


121 


Db 


121 



Score 3937; DB 2; Length 556; 
Pred. No. 3.2e-347; 
.; Mismatches 0; Indels 0; 
r { 



MMMMMMMMMMMMMMIMMMMMMIMMIMIIMMIMM 



lllllllllllllllllllllllllllllllllillllllllllllllllllllllllll 



IIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIMIMIIIIIIMIIIIIIIII 



Qy 181 CTRFYTAEIVSALEYLHGKGI IHRDLKPBNILLNEDMHIQITDFGTAKVLSPBSKQARAN 240 

llllllllilllilllllllllllllllllllllllllllllllllllllllllllllll 
Db 181 CTRFYTABIVSALSYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAKVLSPBSKQARAN 240 

Qy 241 SFVGTAQYVS PBLLTE KSACKS SDLWALGC 1 1 YQLVAGL P P FRAGNBYL I FQKI I KLEYD 300 

Mill IMIIMMIMMMMIIMIIM MIMMIMIMIIMIIMIMM 

Db 241 SFVGTAOYVSPBLLTBKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

Qy 301 FPB KFFP KARD LVE KLLVLDATKRLGCEEMEGYGPL KAH PFFESVTWENLHQOTP P KLTA 360 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMlMllllllll 

Db 301 FPEKFFPKARDLVE KLLVLDATKRLGCEEMEGYG PLKAH PFFESVTWENLHQQTPP KLTA 360 

Qy 361 YLPAMSEDDEDCYGNYDNLLSQFGCMOVSS SSSSHS LSASDTGLPQRSGSN I EQY I HDLD 420 

361 YLPAMSEDDED^^ 420 

Qy 421 SNSFELDLQPSBDBKRLLLEKQAGGNPWHQPVENNLI LKMGPVDKRKGLFARRRQLLLTE 480 

lllllllllllllllllllllllllllllllllllltllllllllllltlllllllllll 
Db 421 SNSFELDLOFSEDEKRLLLEKQAGGNPWHQFVENNLI LKMGPVDKRKGLFARRRQLLLTE 460 

Qy 481 GPHLYYVDPVNKVLKGE I PWSQELRPSAKNFKTPFVHTPNRTYYLMDPSGNAHKWCRKI Q 540 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIItllllllllllllMI 
Db 481 GPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKHCRKI Q 540 

Qy 541 EVWRQR YQSH PDAAVQ 556 

llllilllllllllll 
Db 541 EVWRQRYQSH PDAAVQ 556 



RESULT 5 

US-09-771-161A-1S4 

; Sequence 154, Application US/0977116 la 
; Patent No. 6936450 
GENERAL INFORMATION: 
APPLICANT: LEVINS, et al. 

TITLE OF INVENTION: VARIANTS Of PROTEIN KINASES 

FILE REFERENCE : 802620-2005.1 

CURRENT APPLICATION NUMBER: US/09/771 , 161A 

CURRENT FILING DATE: 2001-01-26 
; PRIOR APPLICATION NUMBER: 09/724,676 
; PRIOR FILING DATE: 2000-11-28 
; PRIOR APPLICATION NUMBER: 136776 

PRIOR FILING DATE : 2000-06-15 
; PRIOR APPLICATION NUMBER: 135619 
; PRIOR FILING DATE: 2000-04-13 
; NUMBER OF SEQ ID NOS: 273 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 154 
; LENGTH : 238 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-771-161A-154 

Query Match 41.7%; Score 1337; DB 2; Length 238; 

Beat Local Similarity 100.0%; Pred. No. 9.6e-99; 

Matchea 236; Conservative 0; Mismatches 0; Indels 0; 



4 




oy 


1 


Db 


1 


oy 


61 


Db 


61 


oy 


131 


Db 


121 


oy 


181 


Db 


iei 



lllllllllilllllltllllllilllllllllilllllllllllllllllllllltlll 



lllilltllllllllllltlllllllltllllllllllltlllltlllllllllllllli 

[IK 120 



M III I llllillll Mill IMIM I Ml llll III llllll I Hill! I III III I II 



II MM lllllllllllll I IIMIM III III Ml MM MM llllllllll II 



RESULT 6 

US-06-943-667-29 

; Sequence 29, Application US/0fl943667 

; Patent No. 6734001 

; GENERAL INFORMATION: 

; APPLICANT: Alessi, Derio R 

; TITLE OF INVENTION: ENZYME 



Jaeckle Pleiechmann & Mugel. LLP 
STREET: 39 Stata Street 
CITY: Rocheater 
STATB: New York 
COUNTRY: USA 
ZIP: 14614-1310 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/943,667 
FILING DATE: 03-OCT-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: GB 9705462.1 
FILING DATE: 17-MAR-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER : GB 9712826.8 
PILING DATE ; 19-JUN-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER : GB 9717253.0 
FILING DATE: 15-AUG-1997 
ATTORNEY/AGENT INFORMATION: 
NAME: Bremen, Suaan J 
REGISTRATION NUMBER: 34,103 
RE FERENCB / DOCKET NUMBER: 87792. 97R421 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 716-262-3640 



TELEFAX: 716-262-4133 
INFORMATION FOR SBO ID NO: 2 
SEQUENCE CHARACTERISTICS: 
LENGTH : 752 amino acids 
TYPE: amino acid 
STRANDEDNBSS : single 
TOPOLOGY: linear 
MOLECULE TYPB: protein 
HYPOTHETICAL: NO 
ANT I -SENSE : NO 
ORIGINAL SOURCE: 

ORGANISM: Drosophila 
US-08-943-667-29 



Query Match 38.9%; 
Best Local Similarity 40.1%; 
Matchea 258; Conaarvative 8 



Score 1144.5; DB 2; Length 
Pred. NO. 8.2e-91; 
i; Mismatches 172; Indole 



5 TSQLYDAVP I QSSWL CSCPS-PSMVRTQTBSSTPPGI PGGSRQGPAMDO 53 

II I M M I II l M III :| 
69 TSPLAAAVASNSSSVATTPHCRMLHNCSLQQYQNDI RQQTB ILDM 113 



oy 

Db 

Oy 
Db 
Oy 



54 TAAEPRPGAGSLQHAQPP PQPRKKRPBDFKFGKi LGEGS 92 

I M II II MM I II II • Mill 

114 LRHBHOQaYO^QQQQQQPQQQQEQQQQQEQSQQQQQLQNPAPRRS-PhOPIFGRYIGra^ 172 

93 FSTWLARELATSREYAI KI LEKRH 1 1 KENKVPYVTRERDVMSRLDH - PFFVKLYFTFQD 151 
:| I II : IIMIM III MM I M MMM -Mill Mil 
173 YSIVYLAVDIHSRREYAI KVCEKRLI LJR.ERKQDYI KREHEVKHQKTKVPGFVNLSCTFQD 232 



152 DBKLYPCLSYAKNGBLLKYIRKIGSFDB 

III = Mh MM II -llll 
233 ARSLYFVMTYARKGDMLPY I NRVGSFI 



•CTRFYTABIVSALEYLHGKDIIHRDLKPENI 211 
III I II- I MM : -llllillll 

^LKPBNI 292 



212 LLNEDMHIQITDFGTAKVL 

IIMIM I IIMIIM 

293 LLDEDMHTLIADFGSAKVMTAHERALATEHCSBl 



Oy 

Db 
Oy 
Db 

Qy 

Db 

Qy 

Db 

Oy 



231 

353 ! 



II M I I 



- S FVGTAQYVS PELLTB KSACKSSDLHAL 268 

IIMIIIIIIIMI -Mill 

. 412 



269 GCI IYQLVAGLPPFRAGNEYLIFQKIIKLETDFPEKFFPKARDLVEKLLVLDATKRLGCE 328 

IIMII-IIIIMI MMII-M. |||: I I III III :| Ml : 
413 GCI VYQMI AGLPPFRGSNDYVI FKE I LZKAVDPPQGFDKDABDLVRKLLRVDPRDRLGAQ 472 

329 EMEG- YGPLKAHPFFESVTWENLHQQTPPKLTAYLPAMSEDDEDCYGNY • - -DNLLSQFG 384 

« I I "Mill • 1= I lllll t Ml :|: HI :\ :| 
473 DE FGYYES I RAH PFFAG I DWQTLRQQTP P P I YPYL PGVSQ - DED FRS SYTVPGDLEPGLD 531 

385 CMQVSSS SSSHS LSASDTG LPQRSGSN I EQY I HDLDSNS FELDLQFS ED E KRLLLE KQAG 444 

hi h I =1 : : llhh I >*| =| 

532 ERQISRLLSABLGVGSSVAMPVKRST AKNSFDLN DABKLQRLEQQK 577 

44S GNPWHQFVENNLILKMGPVDKRKGLFARRRQLLLTEGPHLYYVDPVNKVLK^ 504 

Ml I i MM I I Mf 1 1 1 1 1 1 1 MJ llll II I hill : IIMIM M 

578 TD KWHVFADGEV I L KKD FVNKRKGL FARKRMLLLTTG P RL I Y I D PVQM I KXGB I PWS PDL 637 



Qy 505 RPEAKNF KTFFVHTPNRTYYLMD PSGNAH KWCRKI QEVWRQR Y 547 

i i iiii illinium iiiii h i i i 

Db 638 RAEYKNPKI FFVHT PNRTYYLDDPEGYA I HWSEA I ENKRKLA Y 680 



RESULT 7 

US-09-20S-658-160 

; Sequence 160. Application US/09205658 

; Patent No. 6861256 

; GENERAL INFORMATION: 

; APPLICANT: Ruvkun, Gary 

; APPLICANT: Ogg, Scott 

! TITLE OF INVENTION: THERAPEUTIC AND DIAGNOSTIC TOOLS FOR 

; TITLE OF INVENTION: IMPAIRED GLUCOSE TOLERANCE CONDITIONS 

; PILE REFERENCE: 00786/351004 

; CURRENT APPLICATION NUMBER: US/09/205,658 

; CURRENT FILING DATE: 1998-12-03 

; EARLIER APPLICATION NUMBER: 08/857,076 

; EARLIER FILING DATE: 1997-05-15 

; EARLIER APPLICATION NUMBER: 08/888,534 

; EARLIER FILING DATE: 1997-07-07 

; EARLIER APPLICATION NUMBER: US9B/10080 

; EARLIER FILING DATE: 1998-05-15 

; NUMBER OF SEQ ID NOS: 328 

; SOFTWARE: FastSEQ Cor Windows Version 4.0 

; SEQ ID NO 160 

; LENGTH: 636 

; TYPB: PRT 

; ORGANISM : Caenorhabditia elegans 
US-09-205-658-160 

Query Match 32.0%; Score 940; DB 2; Length 636; 

Best Local Similarity 36.9%; Pred. No. 4.2e-73; 

Matches 200; Conservative 101; Mismatches 159; Indels 82; Gaps 11; 
Qy 62 AGSLQHAQPPPQPRKKRPEDFKFGKI L£EGS FSTWLARELATSREYAI KI LEKRH 1 1 KB 121 

I I- I MUM MIMM I IMM MMMM - « 

Db 54 AKSIKEGCP KRTSNDFMFLQSMG EGAYSGVFRCREVATDAMFAVKVLQKSYLNRH 108 

Qy 122 NKVPYVTRERDVMSRLD HPFFVKLYPTFQDDEKLYFGLSYAKNGELLKYIRKIGS 176 

h . Ill"" I III :|l I I -II . Mhl : • II 

Db 109 QKMDAI IREKNILTYLSQECGGHPFVTQLYTHFHDOARIYFVIGLVENGDLGESLCHFGS 168 

Qy 177 F0BTCTRFYTABIV6ALEYLHGKGIIHRDLia?ENILUIEDMHIQITDFGTAKVLS 231 

II -h II- l-ll hllhlhhh :| II llllhh 

Db 169 FDXLTSKFFASB I LTGLQFLHDNKI VHRDMKPDNVL I QKDGH I L I TDFGSAQAFGGLQLS 228 

Oy 232 PES KQARANS FVGTAQYVS P ELL TE KSAC 260 

II II Ml I II MIMM : 
Db 229 QBG FTDANQASSRSSDSGS P P PTRFYSDBBVP EENTARRTT PVGTAL YVS PEMLADGDVG 288 

Qy 261 KSSDLWALGCI I YQLVAGLPPFRAGNBYL I FQKI I KLEYDFPEKFFPKARDLVBKLLVLD 320 

MM MIMM Ml lllll hi -I M- III I M -= III I 

Db 289 POTDIWGLGCILFQCLAGQPPFRAVNQYHLLKRIQELDFSFPEGFPBBASBI IAKILVRD 348 

Oy 321 ATKRLGCEEMEGYG PLKAH PFFESVTWENLHQQTP P KLTA YL PAMSEDDEDCYGNYDNL - 379 



M: M I II IMM I h II I IMM M I h 

Db 349 PSTRITSQB LMAHKFFENVDWVNIANIKPPVLHAYIPATFGBPB YYSNIG 398 

Qy 380 LSQ FGCMQVSSSSS SHSLSASDTGLPQRSGSN I EQ Y I HDLDSNS FBLDLQF 430 

I - : : || I I MM - - : 
Db 399 PVBPGLDDRALFRLMNLGNDASASQPSTFRP SNVEHRGD P PVSB I APRA N 448 

Qy 431 SEDEKRLL LB KQAGGNPWH QFVENNL I LKMG PVD KRKGLFARRRQLLLTEG PHL Y 485 

II M MM MM I MMM I -MMIIIMI III 

Db 449 SEAEKNRAARAQKLSBQRVKNPFH I FTNNSL I LKQGYLBKKRGLFARRRMFLLTEGPHLL 508 

Qy 486 YVDPVNKVLKGB I PWSQB LRP EAKNFKTFFVHTPNRTYYLMD PSGNAH KWCRK I QEVWRQ 545 

hi I MIIMIM - I II III: HI | | MM I I h 

Db 509 Y I DVPNLVLKGEVPWTPCMQVSLKNSGTFFIHTPNRVYYLFDLBKKADSWCKAI NOV • RK 567 

Qy 546 RY 547 

II 

Db 568 RY 569 



RESULT 8 

US-09-205-65B-159 

j Sequence 159, Application US/09205658 

; Patent No. 6861256 

; GENERAL INFORMATION: 

; APPLICANT: Ruvkun, Gary 

; APPLICANT: Ogg, Scott 

; TITLE OF INVENTION: THERAPEUTIC AND DIAGNOSTIC TOOLS FOR 

; TITLE OF INVENTION: IMPAIRED GLUCOSE TOLERANCE CONDITIONS 

j FILE REFERENCE: 00786/351004 

; CURRENT APPLICATION NUMBER: US/09/205,658 

; CURRENT FILING DATE: 1998-12-03 

; EARLIER APPLICATION NUMBER: 08/857,076 

; EARLIER FILING DATE: 1997-05-15 

; EARLIER APPLICATION NUMBER: 0B/8BB,534 

; EARLIER FILING DATE: 1997-07-07 

; EARLIER APPLICATION NUMBER: US98/100BO 

; EARLIER PILING DATE: 1998-05-15 

; NUMBER OF SEQ ZD NOS: 328 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SE0 ID NO 159 

; LENGTH: 632 

; TYPB: PRT 

; ORGANISM: Caenorhabditis elegana 
US-09-205-658-159 

Query Match 31.8%; Score 935; DB 2; Length 632; 

Best Local Similarity 37.2%; Pred. No. l.le-72; 

Matches 202; Conservative 101; Mismatches 152; Indels 88; Gaps 13; 

Qy 62 AGSLQHAGPPPQPRiattPBDFKFOKILGEGSFSTVVLARBlA^^ 121 

I In I h II I i MIMM I IMM MMMM - = 

Ob 54 AKSIKEGCP IO?TSNDFMFLQSMGEOAYSQVFRCRBVATDAMFAVKVLQKSYLNRH 108 

Oy 122 NKVPYVTRERDVMSRLD H PFPVKLYFTFQDDBKLY FGLSYAKNGELLKY I RKIGS 176 

h : II--: I III II I | -|| : MMI ■ I II 

Db 109 QKMDAI IRBKinLTYLSQEOOGHPFVTQLYTHFHDOARIYPVIGLVBIJCIDI^BSLCHFGS 168 



Oy 


177 


Db 


169 


Oy 


231 


Db 


229 


Oy 


263 


Db 


289 


oy 


323 


Db 


34 9 


Oy 


380 


Db 


399 


oy 


430 


Db 


444 


Oy 


485 


Db 


504 


Oy 


545 


Db 


563 



II i>|. :||:: h'll hllhlhhh M II llllhh 



hi lllh = l = 11 Hill hi 



-ESKC*RANSPVGTAQYVSPBLLTBICSACKS 262 
I II =11111 llllhl : 



= 1 H» III I = 1 



h :| 



I II Mil I h III 11 = 11 



hll 



I h 

-YYSNIGPV 398 



- LSQFGCMQVSSSSSSHSLSASDTGLPQRSGSN I EQ Y I HDLDSNS PELDLQ 429 

: | | : : III I Ihl :« - • 
IDRALFRLMNLGNDA SASQPSTP SNVEHRGDPPVSBIAPRA 443 



II II Ihl Ihl I 1 = 1111 I = = h:||||||| llllllll 



hi I 



= 11 = 



I II llhillll III I I 



lh I =1 1 

-R 562 



RESULT 9 

US-09-538-092-216 

; Sequence 216, Application US/09538092 

; Patent No. 6753314 

; GENERAL INFORMATION: 

; APPLICANT: Giot, Loic 

; APPLICANT: Mansfield, Traci A. 

TITLE OF INVENTION: Protein-Protein Complexes and Method of Using Same 

FILE REFERENCE: 15966-542 

CURRENT APPLICATION NUMBER: US/09/538,092 
CURRENT FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 60/178,965 
PRIOR FILING DATE : 2000-02-01 
NUMBER OF SEQ ID NOS: 1387 
SOFTWARE: CuraPatSeqFormatter Version 0.9 
SEQ ID NO 216 

LENGTH : 766 

TYPE: PRT 

ORGANISM: Saccharomyces cerevisiae 
FEATURE : 

NAME/ KEY: miac_feature 



; LOCATION: <0 )...«» 

; OTHER INFORMATION: Polypeptide Accession Number YOR490C 
US-09-S38-092-216 

Query Match 24.2%; Score 712; DB 2; Length 766; 

Best Local Similarity 30.31; Pred. No. 4.1e-53; 

Matches 187; Conservative 87; Mismatches 154; Indels 190; 



Oy 


27 


Db 


22 


Oy 


68 


Db 


79 


Oy 


96 


Db 


139 


Oy 


15S 


Db 


199 


oy 


215 


Db 


259 


Oy 


264 


Db 


319 


Oy 


324 


Db 


379 


Oy 


370 


Db 


433 


Oy 


373 


Db 


493 


Oy 


430 


Db 


544 


oy 


470 


Db 


589 



I llh 



I I I I I 



-PAMDOT- 

h I 



I h'l I 



II 1 1 



III I II 



= =lhht I nil' II III h 



-AABPRPGAGSLQH 67 
I II =11 

-MQH 78 

-DFKFGKILGEGSFST 95 
llllh Ihlhh 
tDPKFGBQLGDGSYSS 138 



Ihllllh 



I 1 = 1 II =111 hi 



h= II lllllhllllllh 



::| : = =llllllhl I 



• -ARANSFVGTAQYVSPELLTEKSACKSS 263 
h= lllllhlllllll = 



hll llhlh = ll 1 1 1 I Mil llh = h = l I I 



Mlhllll I 



PLKAHPFFESVTWE- -NLHQOTPPKLTAY LPAMSEDD 369 

= 111 II I =1 := lh' I I =11 I 

-IKAHLFFHEVNFEIKSVTODNPPBIQPYXINAEAMKPLQKVSBSDTTV 432 



433 KMANLOLAGNCiHADTPI^APAATSQEHSVISMTAATAAFNKDYTSQPKLGSKSSTSVRSA 492 



11=1 



= hll h 



= 1 II hi =1 II h 



:|: >i| 



•GNPWHQFVENNLILKMGPVDKRKGL 469 
II 1=11 

LDIBPPTOSAGK 588 



1= I I =1 



RESULT 10 

US-09-24B-796A-20495 

; Sequence 20495, Application US/0924B796A 
; Patent No. 6747137 



i GENERAL INFORMATION: 

; APPLICANT: Keith Weinstock et al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO CANDIDA 
ALBICANS 

; TITLE OF INVENTION: FOR DIAGNOSTICS AND THERAPEUTICS 
; PILE REFERENCE: 107196.132 

; CURRENT APPLICATION NUMBER: US/09/248 , 796A 

; CURRENT FILING DATE: 1999-02-12 

; PRIOR APPLICATION NUMBER: US 60/074,725 

; PRIOR FILING DATE: 1998-02-13 

; PRIOR APPLICATION NUMBER: US 60/096.409 

; PRIOR FILING DATE: 1998-08-13 

; NUMBER OF SEQ ID NOS: 28208 

; SEQ ID NO 20495 

; LENGTH: 901 

; TYPE: PRT 

ORGANISM : Candida albicans 
US-09-248-796A-204 95 

Query Match 22.3%; Score 655.5; DB 2; Length 901; 

Best Local Similarity 37.1%; Pred. No. 4.4e-4S; 

Matches 144; Conservative 69; Mismatches 116; Indels 59; Gaps 10; 

Oy 59 RPGAGSLQHAQPPPQP RKKRPEDFKFGKILGEGSFSTVVLARELATSREYA 109 

= 1 III M hUl =1111=111 I = hi II 

Db 53 Q PSNRSTQQOQTNSQPQSQY S 0LAAKRRSARD YQFGAKI GEGSYSTVYSALD KYTNRTYA 112 

Qy 110 I KI LEKRHI I KENKVPYVTRERDVMSRL- -DHPFPVKLYFTFQDDSKLYFGLSYAKNGEL 167 

Ihl lllhlllh II h = = ll II hlhlllh hi I =h III 

Db 113 I KVLSKRH I VKENKI KYVN I E KTTLNRLGQQH PG I VQLYYTFQDESSLFFVLDFAEYGBL 172 

Qy 168 LKY I RKIGSFDETCTRFYTAE I VSALEYLHGKG I IHRDLKPEN I LLNEDMH I QITDFGTA 227 

I III II 1= =11 =h 1= = = l I h I I I I I I I I I I h I = = = 11111 I 
Db 173 LS 1 1 RKFGS LS BSVLKFYMCQ 1 1 DAVRF IHL KGV I HRDLKP EN I LVGHDFNLKI TD FGAA 232 

Oy 228 KVL SPB-SKQARANSFVGTAQYVSPBLLTEKSACKSSDLWAL 268 

hi III I I lllllhlllllll Ihlh 

Db 233 KLLGNHEE FNGEKI DYNSVDASPBVSDTDRKGS FVGTAEYVS PELLKHN 1 00 FESD I WA I 292 

Qy 269 GCIIYQLVAGLPPFRAGNBYLIFOKI IKLEYDF- -PEKFFPKARDLVEKLLVLDATKRLG 326 

llhll hill: III h||| ::| . | | h= =h » « I 
Db 293 GCI LYQFFNGVPPFKGSTBYLTFEKI I NVDYSYKSP I PLPPDVI QL IDL I LI SEPSARAT 352 

Qy 327 CSEMEGYGPLKAHPFFESVTWEN- - -LHQQTPPKLTAY LPAMSEDDEDCYGN 375 

: * i| . h= : : |: h =1 I == I 
0b 353 IPH IMSSRWPSDIPWDOQNFIWHRKVPRFEAFGAITPSPAMPPMMKN GT 401 

Oy 376 YDNLLSQFGCMQVS SSSSSHS LSASDTG 403 

h i hi • lh I 

Ob 402 NRNVNKSYSYHQLQS QILASECG 424 



RESULT 11 
US-09-20S-6S8-173 

; Sequence 173, Application US/09205658 
; Patent No. 6861256 
; GENERAL INFORMATION: 



; APPLICANT: Ruvkun, Gary ■ 

; APPLICANT: Ogg, Scott 

; TITLE OP INVENTION: THERAPEUTIC AND DIAGNOSTIC TOOLS FOR 

; TITLE OF INVENTION: IMPAIRED GLUCOSE TOLERANCE CONDITIONS 

; FILE REFERENCE: 00786/351004 

; CURRENT APPLICATION NUMBER: US/09/205 , 658 

; CURRENT FILING DATE: 1998-12-03 

; EARLIER APPLICATION NUMBER: 08/857,076 

; EARLIER FILING DATS: 1997-05-15 

; EARLIER APPLICATION NUMBER: 08/888,534 

; EARLIER FILING DATE: 1997-07-07 

; EARLIER APPLICATION NUMBER: US98/10080 

; EARLIER FILING DATE: 1998-05-15 

; NUMBER OF SEQ ID NOS: 328 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 173 

; LENGTH: 113 

; TYPE: PRT 

; ORGANISM: Mus iraisculus or Homo sapiens 
US-09-205-658-173 

Query Match 20.9%; Score 614; DB 2; Length 113; 

Best Local Similarity 100.0%; Pred. No. 8.4e-46; 

Matches 113; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 263 SDLWALGCI IYQLVAGLPPFRAGNEYLIFQKI IKLEYDFPEKFFPKARDLVEKLLVLDAT 322 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1 SDLWALGCI IYQLVAGLPPFRAGNEYLIFQKI IKLEYDFPEKFFPKARDLVEKLLVLDAT 60 

Qy 323 KRLGCE EMEGYG PLKAH P F FESVTWENLHQQTP PKLTAYLPAMS EDDEDCYGN 375 

IIIIIMIIMIIIIIIIIIIIMMIlllllllllll imilllllllMI 

Db 61 KRLGCE EMEGYGPLKAHPFFESVTWENLHOQTPPKLTAYLPAMS EDDEDCYGN 113 



RESULT 12 
US-09-205-658-202 

; Sequence 202, Application US/09205658 

; Patent No. 6861256 

; GENERAL INFORMATION: 

; APPLICANT: Ruvkun, Gary 

; APPLICANT: Ogg, Scott 

; TITLE OP INVENTION: THERAPEUTIC AND DIAGNOSTIC TOOLS FOR 

; TITLE OF INVENTION: IMPAIRED GLUCOSE TOLERANCE CONDITIONS 

; FILE REFERENCE : 00786/351004 

; CURRENT APPLICATION NUMBER: US/09/205,658 

; CURRENT FILING DATE: 1998-12-03 

; EARLIER APPLICATION NUMBER: 08/857,076 

; EARLIER FILING DATE : 1997-0S-1S 

; EARLIER APPLICATION NUMBER: 08/888,534 

; EARLIER PILING DATE: 1997-07-07 

; EARLIER APPLICATION NUMBER: US98/10080 

; EARLIER FILING DATE: 1996-05-15 

; NUMBER OF SBO ID NOS : 328 

; SOFTWARE: PastSEO for Windows Version 4.0 

; SEQ ID NO 202 
; LENGTH: 104 
TYPB: PRT 



; ORGANISM: Homo sapiens 
US-09-205-658-202 

Query Match 19.5%; Score 573; DB 2; Length 104; 

Beat Local Similarity 100.0%; Prod. No. 2.8e-42; 

Matches 104; Conservative 0; Mismatches 0; Indole 0; Gaps 0; 

Qy 439 I^KC*OGJIPWHQFVENNLILKMGPVD]C*IQ3LFARR^^ 498 

1 LBKcJLIi!^ 60 

Oy 499 PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKyCRKIQEV 542 

lllllllllillllllllllllllllllltllltlltlllllll 
Db 61 PWSQSLRPBAKNFKTFFVHTPNRTYYLMDPSGNAHKWCRKI QEV 104 



RESULT 13 

US-09-949-016-11701 

; Sequence 11701, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OP INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

,- PILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

I CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

I PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS: 207012 

j SOFTWARE: FastSEO Cor Windows Version 4.0 

; SEQ ID NO 11701 

LENGTH: 420 

TYPE: PRT 

ORGANISM: Human 
US-09-949-016-11701 

Query Match 19.4%; Score 571. 5; DB 2; Length 420; 

Beat Local Similarity 35.5%; Pred. No. 3e-41; 

Matches 127; Conservative 67; Mismatches 139; Indels 25; Gaps 7; 

Oy 20 IX^CPSPSMVRTCTES^PPGIPGGSRCGPAMDGTAAEPRPCJVGSLQHAQPPPQPRKKRP 79 

I II II : 11 = II II =1 I h I 
Db 52 LARCPGPECVPMEA PGLAQAAAAS SDSRKVAEETPDGAPAL CPS P SALS PEP 103 

Qy 80 EDFKFGKI LGEGSFSTWLARELATSREYAIKILEKRHI IKENKVPYVTRERDVM 134 

:|| :| }:| | | :| :|:|:: :|: : :| |: |: 

Db 104 PVYS LODFDTLATVGTGT FGR VHLVKE KTAKH FFALKVMS I PDV I RLKOEQHVHNE KSVL 163 

Qy 135 S RLDH PF FVKL Y FTFQDDE KL Y FG LS YAKNGELL KYIRKIGSFD BTCTRFYTAE I VSALB 194 

• III I: II : I III hi II I 11 = 111= hi 

Db 164 KEVSHPFLIRLFWTWHDERFLYMIjMEYVPGGELFSYLRNRGRFSSTTGLFYSAEI ICAIB 223 



Qy 195 YLHGKQI IHRDLKFENILLNESMHIQITDFGTAKVLSPBSKQARANSFVGTAQYVSPBIX 254 

III I h = lllllllllh I ll==llll II I I : II <|ii||:> 

Db 224 YLHSIG3IVYRDLKPENILLDRJXJHIKLTDFGFAKKLVD RTWTLCGTPBYLAPBVI 278 

Qy 255 TBKSACKSSDLMALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLEYOFPBKFFPKARDLVB 314 

I »: I III! >l I III I • hlll> > Ill = lh = 

Db 279 QS KDHGRAVDWWALG I L I FBHLSG P PPFFDDNPFG I YQ KI LAGKI D FPRHLD FHVKDL I K 338 

Oy 315 lOAiVLOATKRLQCBEMEGYCPLKAHPFFESVTWENLHQQ- -TPPKLTAYLPAMSEDDE 370 

1111 = 1 hill I 'I I »l II II i h II :| t: | j 

Db 339 lOAVVDRTRRLO-NMJOtfGANDVKHHlWPllSVDWBAV^ IVPKIAGDGD 391 



RESULT 14 
US-09-031-295-2 

; Sequence 2, Application US/0 90312 95 

; Patent No. 6326181 

; GENERAL INFORMATION: 

; APPLICANT: LANG, Florian 

; APPLICANT: WALD SOGER, Tubingen 

; TITLE OF INVENTION: CELL VOLUMB- REGULATED HUMAN KINASE H-SGK 

; NUMBER OP SEQUENCES: 4 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: FOLEY fc LARDNBR 

; STREET: 3000 K Street. N.W. 

; CITY: Washington 

; STATE : D.C. 

; COUNTRY: U.S.A. 

; ZIP: 20007-5109 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS -DOS 

; SOFTWARE: Patent In Release 11. 0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/031.295 

; FILING DATS: 26-FEB-1998 

; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER : DE 197-08-173.8 

; FILING DATE: 28-FEB-1997 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Sandercock, Colin G. 

; REGISTRATION NUMBER: 31,298 

; REFERENCE/ DOCKET NUMBER: 058315/0123 

; TELECOMMUNICATION INFORMATION: 

j TELEPHONE: (202) 673-5300 

; TELEFAX: (202) 672-S399 

; INFORMATION FOR SEQ ID NO: 2: 

; SEQUENCE CHARACTERISTICS: 

; LENGTH: 431 amino acids 

; TYPE; amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

US-09-031-295-2 



Query Match 19.1%; Score 562; DB 2; Length 431; 

Best Local Similarity 37.5%; Pred. No. 2.1e-40; 

Matches 123; Conservative 65; Mismatches 116; Indels 24; 



Gaps 



Qy 


59 


Db 


62 


oy 


106 


Db 


122 


oy 


165 


Db 


182 


oy 


22S 


Db 


242 


oy 


285 


Db 


300 


oy 


345 


Db 


358 



:| III I 



Ihhhh hh 



Ihl 



- R PED FKFG K I LGEGS FSTWLARBLATS 105 

:| II I h'hlll hill I 



III I hhll Hill I I 



III 



I I 



I I III III III III h = lllllllllh II =111 



:| || :|::||:| :: :: | | || ,.|,,, Mill « 
TFCGT PEYLAPEVLH KQPYDRTVDWWCLGA VLYEML YGLP PFYS 299 



II hi II I Mil 



= hl II 



III 



=1=1 =1 

-FNPNVSGPNE 381 



RESULT 15 
US-10-000-039-2 

i Sequence 2, Application US/10000039 
; Patent No. 6855520 

INFORMATION: 
APPLICANT: LAND. Florian 

WALDBGGBR, Tubingen 
TITLE OF INVENTION: CELL VOLUMB -REGULATED HUMAN KINASE H-SGK 
NUMBER OF SEQUENCES : 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: FOLEY t LARDNBR 
STREET: 3000 K Street, N.W. 
CITY: Washington 
STATE: D.C. 
COUNTRY: U.S.A. 
ZIP: 30007-5109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release fll.o, Version 41.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/000,039 
FILING DATE: 04-Dec-2001 
CLASSIFICATION: < Unknown > 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/031,295 
FILING OATS: 26-FBB-1998 



APPLICATION NUMBER: DB 197-08-173.8 
FILING DATE: 28-FEB-1997 
ATTORNEY/ AGENT INFORMATION: 

NAME: Sandercock, Colin G. 
REGISTRATION NUMBER: 31,2 98 
REFERENCE/ DOCKET NUMBER: 058315/0123 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (202) 672-5300 
TELEFAX: (202) 672-5399 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH : 431 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-10-000-039-2 



Query Match 
Beat Local Similarity 
Matches 123 ; 



19.1%; Score 562; DB 2; Length 431; 
37.5%; Pred. No. 2.1e-40; 

65; Mismatches 116; Indels 24; 



Gaps 



oy 


59 


Db 


62 


Qy 


106 


Db 


122 


oy 


165 


Db 


182 


oy 


225 


Db 


242 


Oy 


285 


Db 


300 


oy 


34S 


Db 


358 



=1 I =1 III I 



Ihhhh 1 = 1 = 



11 = 1 



-RPEDFKFGKILGEGSFSTWLARELATS 105 
= 1 II I l= = l = MI hill I 



= III I hhll =1111 I I 



III 



I I 



I I IN III III III h:||llliillh II =111 



= 1 II =h = lhl 



I II ==! = = = Mill 



II hi II I Mill 



= hl II 



l = :| 



III 



Search completed: June 27, 2006, 01:38:02 
Job time : 53 sees 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Bioccaleration Ltd. 




OH protein - protein search, uaing aw model 

Run on: June 27, 2006, 01:28:03 j Search time 290 Seconda 

(without alignment a) 
1725.668 Million cell updatea/aec 

Title: US-10-6B9-S76-1 
Perfect acore: 2941 

Sequence: 1 MARTTSQLYDAVPIQSSWL RKIQEVWRQRYQSHPDAAVQ 556 

Scoring table: 8LOSUN62 

Gapop 10.0 , Gapext 0.5 

Searched: 2849598 seqs. 925015592 reaiduee 

Total number of hite aatiafying choaen parametera: 2849598 

Minimum DB eeq length: 0 

Maximum DB eeq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Liating first 45 aumraariea 

Database : Uni Prot_7 . 2 : • 

1 : uniprot_sprot : • 
2: uniprot_trembl : * 

Pred. No. ia the number of results predicted by chance to have a 
acore greater than or equal to the acore of the reault being printed, 
and ia derived by ana lye ia of the total acore distribution. 



SUMMARIES 

I 



Result 

No. 


Score 


Query 

Match Length DB 


ID 


Deacription 


1 


2941 


100.0 


556 


1 


PDPK1 HUMAN 


015530 homo sapien 


2 


2941 


100.0 


556 


2 


Q6FI20 HUMAN 


Q6fi20 homo eapien 


3 


2926 


99.5 


556 


2 


0S3HJ9_HUMAN 


Q53hj9 homo sapien 


4 


2787.5 


94.8 


559 


1 


PDPK1_RAT 


055173 rattua norv 


5 


2769.5 


94.2 


559 


1 


PDPK1 MOUSE 


Q9z2a0 sue mueculu 


6 


2764.5 


94.0 


559 


2 


Q3UHZ0_MOUSB 


Q3uhz0 mua muaculu 


7 


2758.5 


93.8 


559 


2 


Q3UGN6~MOUSB 


Q3ugn6 mua muaculu 


8 


2732.5 


92.9 


551 


2 


Q610Z4 MOUSE 


QB10z4 mua muaculu 


9 


2626.5 


89.3 


532 


2 


QBK3L3 MOUSE 


QBk313 mua muaculu 


10 


2626.5 


89.3 


556 


2 


0SF3U4_CHICK 


Q5f3u4 gallua gall 


11 


2560.5 


87.1 


522 


2 


Q3TRL2~MOUSB 


Q3trl2 mua muaculu 


12 


2391.5 


81.3 


sse 


2 


Q4S4R0 TETNG 


Q4a4r0 tetraodon n 


13 


2336 


79.4 


492 


2 


Q59BH6~HUMAN 


Q59eh6 homo sapien 


14 


2228 


75.8 


537 


2 


Q6NZV1 BRARE 


Q6nzvl brachydanio 


15 


2202.5 


74.9 


429 


2 


Q9BRDS HUMAN 


Q9brd5 homo aapien 



2093.5 


71.2 


434 


3 


Q3UEW8_MOUSS 


Q3uew8 


mua muaculu 


1933 


65.7 


367 


2 


Q9UPJB~HUMAN 


Q9upj8 


homo aapien 


1826 


62.1 


396 


3 


Q6ALA3~HUMAN 


Q6ala3 


homo aapien 


1710 


58.1 


319 


2 


Q9UPJ7~HUMAN 


Q9upj7 


homo aapien 


1622. S 


55.2 


S71 


2 


Q76BX2 ASTPB 


Q76bx2 


asterina pe 


1431 


48.7 


822 


2 


Q9BMX6~APLCA 


Q9brax6 


aplyaia cal 


1144.5 


38.9 


B36 


1 


PDPK1 DROMB 


Q9w0vl 


droaophila 


1037 


34.9 


484 


2 


Q7PUX5_ANOGA 


07pux5 


anophelea g 


940 


32.0 


636 


1 


PDPK1_CAEEL 


Q9ylj3 


caenorhabdi 


939 


31 . 9 


813 


2 


QSB8M0_EMENI 


Q5b8ta0 


aspergi 1 lua 


930 


31.6 


686 


2 


Q54TW2~DICDI 


QS4tw2 


dictyostel i 


928 


31.6 


850 


2 


Q2UBU7~ASPOR 


Q2ubu7 


aspergi 1 lua 


908. 5 


30 . 9 


B39 


2 


Q4WYD7~ASPFU 


Q4wyd7 


aspergi 1 lua 


903 


30.7 


630 


2 


Q5WNC8~CAEBR 




caenorhabdi 


873.5 


29.7 


592 


1 


KSG1_SCHP0 


012701 


echizosacch 


868.5 


29.5 


812 


2 


Q4KWD3 OIBZB 


Q4hwd3 


gibber el la 


868.5 


29. S 


922 


2 


Q873K1~NEUCR 


Q873kl 


neurospora 


833.5 


28.3 


1542 


2 


Q6CFM7~YARLI 


Q6cfm7 


yarrowia li 


832 


28.3 


488 


2 


Q9LZ74_ARATH 


Q91z74 


arabidopsia 


832 


28.3 


491 


2 


09XF67_ARATH 


Q9xf67 


arabidopeia 


821 


27. 9 


498 


2 


Q8RUB8 ORYSA 


Q8rue8 


oryza aativ 


820 


27.9 


483 


2 


Q9SQY4 ARATH 


Q9eqy4 


arabidopeia 


820 


27.9 


486 


2 


04V3C8 ARATH 


Q4v3c8 


arabidopaia 


816.5 


27.8 


494 


2 


QSI6B8~LYCBS 


Q5i6e8 


lycopersico 


800 


27.2 


908 


2 


054PK9 DICDI 


054pk9 


dictyostel i 


795. S 


27.0 


1062 


2 


06 B I KB DEBHA 


Q6bik8 


debaryomyce 


763.5 


26.0 


944 


2 


Q5A3P6~CANAL 


Q5a3p6 


Candida alb 


760. S 


25.9 


1153 


2 


Q5A3V9 CANAL 


Q5a3v9 


Candida alb 


74B.S 


25.5 


1081 


1 


KOK0 YEAST 


Q12236 


eaceharorayc 


733 


24.9 


408 


2 


Q3B9L6 ARATH 


Q3e916 


arabidopsia 



ALIGNMENTS 



RESULT 1 
PDPK1_HUMAN 

ID PDPK1 HUMAN STANDARD; PRT; 556 AA. 

AC O15S30; 

DT 18-OCT-2001, integrated into UniProtKB/Swiaa-Prot . 

DT 01- JAN- 1998, eequence vera ion 1. 

DT 07-MAR-3006, entry veraion 58. 

DB 3 -phoaphoinoaitide -dependent protein kinase 1 (EC 2.7.1.37) (hPDKl). 

GN Name-PDPKl; Synonyms- PDK1 ; 

OS Homo eapiena (Human) . 

OC Bukaryota; Metazoa; Chordata; Craniate; Vertebrate; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglirea; Priraatea; Catarrhini; Horainidae; 

OC Homo. 

OX NCBI TaxID-9606; 

RN (1] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). 

RX MEDLINE- 9725074 9; PubMed-9094314 ; DOI-10 . 1016/S0960- 9822 (06)00122-9; 

RA Aleeai D.R., James S.R., Dovnea CP., Holmes A.B., Gaff nay P.R.J. , 

RA Reeae C.B., Cohen P.; 

RT "Characterization of a 3 -phosphoinoai tide -dependent protein kinase 

RT which phoaphorylatea and activatoe protein kinase B alpha."; 

RL Curr. Biol. 7:261-269(1997). 



RN [2) 

RP NUCLEOTIDE SEQUENCE (MRNAJ (ISOFORM 1). 

RX MEDLINE-98035195; PubMed-9368760 ; DOI-10 . 1016/S0960-9822 (06) 00336-8 ; 

RA Aleasi D.R., Deak M. , Caaamayor A., Caudwell F.B., Morrice N.A. , 

RA Norman D.G. , Gaffney P.R.J. , Reeae C.B., MacDougall C.N., Harbiaon D., 

RA Aahworth A., Bownea M. ; 

RT "3-phosphoinoaitide-dependent protein kinaae-1 (PDK1) : structural and 

RT functional homology with the Droaophila DSTPK61 kinaaa."; 

RL Curr. Biol. 7:776-789(1997). 

RN |3) 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 3). 

RC TISSUE-Myeloid; 

RX MEDLINE-98111410; PubMed-9445477 ; DOI-10 . 1126/scienee. 279. S351 . 710; 

RA Stephens L.R.. Anderson K.B., Stokoe D. , Brdjument-Bromage H., 

RA Painter G.F., Holmes A.B., Gaffney P.R.J. , Reese C.B., McCormick P., 

RA Tempat P., Coadwell W.J. , Hawkina P.T.; 

RT "Protein kinaaa B kinases that mediate phosphatidyl inositol 3,4,5- 

RT trisphoophate-dependent activation of protein kinaae B . " ; 

RL Science 279:710-714(1998) . 

RN (4] 

RP NUCLEOTIDE SEQUENCE (LARGS SCALE MRNA) (ISOFORM 1). 

RC TISSUE-Kidney; 

RX MEDLINE-223B82S7; PubMed-12477932; DOI-10 . 1073/pnas. 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D. , Collina F.S., Wagner L., Shenroen CM., Schuler G.D., 

RA Altachul S.F., Zeeberg B. , Buetow K.H., Schaefer CP., Bhat N.K., 

RA Hopkina R.F., Jordan H., Moore T., Max S.I., Wang J., Haieh P., 

RA Diatchenko L. , Ma ru sine K. , Parmer A. A. , Rubin G.M., Hong L. . 

RA Stapleton M. , Soarea M.B., Bona 1 do M.F., Casavant T.L., Scheetz T.B., 

RA Brownatein M.J., Uadin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J. , Ab rams on R.D., Mullahy S. J. , 

RA Boaak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richarda S., Worley K.C . Hale S., Garcia A.M., Gay L.J., Hulyk S.W. , 

RA Villalon D.K.. Muzny D.M., Sodergren B.J., Lu X., Gibbe R.A., 

RA Fahey J., Helton B. , Ketteman M. . Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C. , Shevchenko Y., Bouffard G.G. , 

RA Blakealey R.W. , Touchman J.W., Green B.D., Dickaon M.C, 

RA Rodriguez A.C, Grirowood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. . Krzywinaki M.I., Skalska U., Smailua D.B., 

RA SchnerchA., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . * ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN (5) 

RP MUTAGENESIS OF ARG-474, AND ALTERNATIVE SPLICING. 

RX MEDLINE-98301766; PubMed-9637919; DOI-10 . 1016/S0960 -9822 (98) 70274-X; 

RA Anderson K.B.. Coadwell W.J., Stephens L.R., Hawkins P.T.; 

RT ■Translocation of PDK-1 to the plasma membrane is important in 

RT allowing PDK-1 to activate protein kinase B."; 

RL Curr. Biol. 8:684-691(1998). 

RN (6) 

RP PHOSPHORYLATION SITES SER-25; SER-241; SER-393; SBR-396 AND SER-410. 

RP AND MUTAGENESIS OF SBR-2S; SER-241; SER-393; SER-396 AND SER-410. 

RX MEDLINE- 99386657; PubMed-104 55013; 001-10.1042/0264-6021:3420287; 

RA Caaamayor A., Morrice N.A. . Aleeai D.R.; 

RT 'Phosphorylation of Ser-241 ia essential for the activity of 3- 

RT phosphoinosi tide -dependent protein kinaae-1: identification of five 



eltea of phoaphorylation in vivo."; 
Biochem. J. 342:287-292(1999). 
17) 

MUTAGENESIS OF ALA-277. 

MEDLINE- 99292604; PubMed-10364160; 

Paradia S., Ail ion M. , Toker A., Thomas J.H., Ruvkun G. ; 
*A PDK1 homo log is necessary and sufficient to transduce AGE- 1 PI3 
kinaae signals that regulate diapauae in Caenorhabdi t is elegana."; 
Genea Dev. 13:1438-1452(1999). 
(81 

PHOSPHORYLATION SITES TYR-9; SER-241; TYR-373 AND TYR-376, AND 
MUTAGENESIS OP TYR-9; TYR-373 AND TYR-376. 

MEDLINE-21463095; PubMed-11481331 ; DOI-10 ■ 1074/ jbc.M105916200 ; 
Park J., Hill M.M., Heaa D. , Brazil D.P., Hofateenge J., 
H earnings B. A. ; 

"Identification of tyrosine phosphorylation sites on 3- 
phosphoinositide-dependent protein kinaae-1 (PDK1) and their role In 
regulating kinaae activity."; 
J. Biol. Chem. 276:37459-37471(2001). 
[9] 

PHOSPHORYLATION SITE SER-241. 
PubMed-15302935; DOI-10. 1073/pnaa. 0404720101; 

Beauaoleil S.A., Jedrychowski M. , Schwartz D. , Elias J.E., Villen J., 
Li J., Cohn M.A., Cantley L.C , Gygi S.P.; 

■ Large- scale characterization of HeLa cell nuclear phoaphoprotaine. " ; 
Proc. Natl. Acad. Sci. U.S.A. 101:12130-12135(2004). 

-1- FUNCTION: Phoaphorylatea and activates not only PKB/AKT, but also 
PKA, PKC-zeta, p70S6K and p90S6K/RSK. May play a general role in 
signaling proceasea and in development (By similarity) . Isoform 3 
is catalytically inactive. 

- 1 - CATALYTIC ACTIVITY: ATP + a protein - ADP ♦ a phosphoprotein. 

-1- SUBCELLULAR LOCATION: Cytoplasmic and membrane -aasociated after 
cell stimulation leading to its translocation. Tyrosine 
phosphorylation aeema to occur only at the plasma membrane. 

- I - ALTERNATIVE PRODUCTS: 

Event -Alternative splicing; Named isoforms-3; 
Name-1; 

Isold-01 5530-1; Sequence-Displayed; 
Name- 2; 

IaoId-015530-2; Sequence- VSP_0 04 894 ; 
Name- 3 ; 

IaoId-015530-3; Sequence -VSP_0 04 8 9S; 
-1- TISSUE SPECIFICITY : Appeera to be expressed ubiquitously. 
-1- PTM: Phosphorylated on tyrosine and serine/ threonine. 

Phosphorylation on Ser-241 in the activation loop is required for 

full activity. PDK1 itself can autophoaphorylate Ser-241, leading 

to ita own activation. 
- 1 - SIMILARITY: Belongs to the Ser/Thr protein kinaae family. PDK1 

subfamily. 
-1- SIMILARITY: Contains 1 PH domain. 



Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
Distributed under the Creative Commons Attributlon-NoDerivs License 



EMBL; AF017995; AACS1825.1; -; mRNA. 
EMBL; Y15056; CAA75341.1; -; mRNA. 
EMBL; BC012103; AAH12103.1; -; mRNA. 



OR 


PDB; 


1H1H; 


X-ray; 


A-71-359. 


PT 


VARSPLIC 


1 


50 


DR 


PDB; 


10KY; 


X-ray; 


A-51-360. 


FT 








DR 


PDB; 


10KZ; 




A- 51 -360. 


FT 


VARSPLIC 


238 


263 


DR 


PDB; 


1UU3; 


X-ray; 


A-51-360. 


PT 








DR 


PDB; 


imr?; 


X-ray; 


A-51-360. 


FT 


MUTAGEN 


9 


9 


DR 


PDB; 


1UUB; 


X-ray; 


A-51-360. 


FT 








DR 


PDB; 


1UU9; 


X-ray; 


A-73-3S7. 


FT 




25 


25 


DR 


PDB; 


1UVR; 


X-ray; 


A-71-359. 


PT 


MUTAGEN 


241 


241 


DR 


PDB; 


1W1D; 


X-ray; 


A-409-556. 


FT 


MUTAGEN 


277 


277 


DR 


PDB; 


1W10; 


X-ray; 


A-409-S56. 










DR 


PDB; 


1W1H; 


X-ray; 


A/B/C/D-409-556. 


Query Match 




100. 


DR 


PDB; 


1Z5H; 


X-ray; 


A-74-359. 


Beat Local Similarity 


100. 


DR 


PDB; 


2BIY; 


X-ray; 


A-51-360. 


Katchea 556; 


Conservative 



Bnsembl; EN5G0000O14O992; Homo aapiena. 
H-InvDB; HIX0012732; -. 
KGNC; HGNC:B816; PDPK1 . 
MIM; 605213; gene. 
LinkHub; 015530; -. 
GO; GO: 0005737; C:cytoplaara; IBP. 
GO; GO:0005B86; C:plaama membrane; IBP. 

GO; GO:0004676; F:3-phoaphoinosi tide -dependent protein kinaae. . .; TAB. 

GO; GO: 0030036 ; P: act in cytoskeleton organization and biogenetic; TAS. 

GO; GO: 0008286; P:inaulin receptor aignaling pathway; TAS. 

GO; GO: 0006468; P: protein amino scid phosphorylation; TAS. 

InterPro; IPR001849; PH. 

InterPro; I PRO 11 993; PH_typo. 

InterPro; IPR000719; Prot_kinaae. 

InterPro; IPR0OB271; Ser_thr_pkin_AS . 

InterPro; IPR002290; Ser thrjpkinaae. 

InterPro; IPR001245; Tyrj?kinese. 

Pfam; PF00069; Pkinaae; 1. 

ProDora; PD000001; Prot_kinaaa; I. 

SMART; SM00220; S_TKc; 1. 

PROSITE; PS50003; PHDOKAIN; FALSE_NEG. 

PROSITE; PS00107; PROTEIN KINASE_ATP; 1. 

PROSITE; PS50011; PROTB I N~KI NASE~DOM ; 1. 

PROS ITS; PS00108; PROTBIN~KINASE~ST; I. 

3D-structure; Alternative splicing; ATP-binding; Kinaae; Membrane; 
Nucleotide -binding; Phosphorylation; Serine/ threonine-protein kinaae; 
Transferase. 



FT 


CHAIN 


1 


S56 


3 - phospho inos i t i de -dependent protein 


FT 








kinase 1. 


FT 








/FTId-PRO_00000865O0. 


FT 


DOMAIN 


82 


342 


Protein kinase. 


FT 


DOMAIN 


459 


550 


PH. 


FT 


NP_BIND 


88 


96 


ATP (By similarity) . 


FT 


COMPBIAS 


389 


398 


Poly-Ser. 


FT 


ACT_SITB 


205 


205 


Proton acceptor (By similarity) . 


FT 


BINDING 


111 


111 


ATP (By similarity) . 


FT 


MOD RES 


9 


9 


Phosphotyrosine . 


FT 


MOD RES 


25 


2S 


Phosphoserine. 


FT 


MOD RES 


241 


241 


Phosphoserine (by autocatalyeis) . 


FT 


MOD RES 


373 


373 


Phosphotyrosine . 


FT 


MOD~RES 


376 


376 


Phosphotyros ine . 


FT 


MOD RES 


3 93 


393 


Phosphoserine. 


FT 


MOD RES 


396 


396 


Phosphoserine. 


FT 


MOD RES 


410 


410 


Phoaphoserine. 



Missing (in isoform 2) . 
/FTId.VSP_004B94. 
Missing (in isoform 3) . 
/FTId-VSPJ)04B95. 

Y->F: Slight reduction in pervanada te- 
at inula ted tyrosine phosphorylation. 
S->A: No effect. 
S->A: No activation. 

A->V: 3-fold increaae in kinase activity. 



r; Score 2941; DB It 
t, Prod. No. 3.5e-184; 
0; Miamatchea 0; 



Length 556; 



Oy 


1 


Db 


1 


Oy 


61 


Db 


61 


Oy 


121 


Db 


121 


Oy 


161 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Oy 


361 


Ob 


361 


Oy 


421 


Db 


421 


Oy 


481 


Db 


481 


Oy 


541 


Db 


541 



llllllfllllllllllllllllllllllllllllllllllllllllllllltllltlll 



IIIIHItllllllltlllllllltllltlllllllllllllllllillltlllllllll 



Itllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



llllllllllllllllllllllllllllllllllllllllllltlllllllllllllll 



llllllllllllllllllllllllltllllfllllllllllllllltlllllllllllll 



llllllllllllllllllllllllllllllllllllllllllllllllillllllllltl 



YL PAMS EDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTGLPQRSGSNI E0Y I HDLD 420 
lllltllllllllllllllllllllltllllllllllllllllllllllllllltlllll 



lllllllllllllllllllllllillllllllllltllllllllllllillllllillll 



llllllllllllllllllllllllllltlllllllllllllillllllllllllllllll 



lllllllll lllllll 



RESULT 2 
Q6FI20_HUMAN 



Schick M. , Eisenstein S.. 
Shen B., Henze S., Mar W., 



Q6PI20_HUMAN PRELIMINARY; PRT; 556 AA. 
Q6FI20; 

10-MAY-200S, integrated into UniProtKB/TrEMBL. 

10-MAY-2005, sequence version 1. 

21-FEB-2006, entry veraion 10. 

PDPK1 protein. 

Name-PDPKl; 

Homo sapiens (Human) . 

Bukaryota; Metazoa; Chordata; Crsniata; Vertebrate; Euteleostomi; 
Manmalia; Butheria; Euarchontoglireo; Primates; Catarrhini; Hominidae; 
Homo. 

NCBI_TaxID-9606; 
tl] 

NUCLEOTIDE SEQUENCE . 
Kalleck A., Ebert L. , Mkoundinya M. , 
Neubert P.. Kstrang K. , Schatten R. , 
(Corn B. , Zuo D. , Hu Y. , LaBaer J.; 

Submitted (JUN-2004) to the EMBL/ GenBank/ DDBJ databases. 

-!- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP ♦ a phosphaprotein. 

-i- SIMILARITY: Belonga to the Ser/Thr protein kinase family. 

Copyrighted by the UniProt Consortium, see http://www.uniprot.org/ternw 
Distributed under the Creative Commons Attribution -NoDerivs License 

EMBL; CR536517; CAG38755.1; -; mRNA. 

SMR; Q6FI20; 72-358. 409-556. 

Eneembl; ENSG00000140992; Homo sapiens. 

GO; GO: 0005524; P:ATP binding; ISA. 

GO; GO: 0000166; F: nucleotide binding; IBA. 

GO; GO: 0004674; F: protein serine/ threonine kinase activity; IEA. 

GO; GO: 0016740; F ; transferase activity; IBA. 

GO; 00:0006468; P:protein amino acid phosphorylation; IEA. 

InterPro; IPR011993; PH_type. 

InterPro; IPR000719; Protjtineee. 

InterPro; IPR008271; Ser_thr_pkin_AS. 

InterPro; IPR002290; Ser_thr_pkinaae. 

InterPro; IPR001245; Tyr_pkinase. 

Pfam; PF00069; Pkinase; 1. 

SMART; SM00220; STKc; 1. 

PROSITE; PS00107; PROTEINJCINASB ATP; UNKN0WN_1 . 
PROSITB; PS50011; PROTB I N~KI NASEDOM ; 1. 
PROSITE; PS00108; PROTB I N~KI NASE~ST ; 1. 
ATP-binding; Kinase; Nucleotide-binding; 
Serine/threonine-protein kinase; Transferase. 
SEQUENCE 556 AA; 63152 MW; ED8C0306DC4DO653 CRC64; 



Query Match 100.0%; Score 2941; DB 2; 

Best Local Similarity 100.0%; Pred. No. 3.5e-184; 
Matches 556; Conservative 0; Mismatches 0; 



Length 556; 



1 MARTTSQLYDAVPIQSSWLCSCPSPSMVRTQTE5STPPOIPGO^ 60 

llllllllllllllllllllllllllllllilllllllllllllllllllllllllllll 
1 MARTTSQLYDAVPIO^SVVLCSCFSPSMVRTOTESCTPPGIPGG^ 60 

61 GAGSLQHAQPPPQPRiOOlPEDFIGTGKILGEGSFSTVVLARBLATSRBYAIKILBKRHI IK 120 

IIIIIIIIIIIIIIMI1 IMMIMIIIIIIIIIIIIIlll III III I lllllll MM 

61 GAGSLOHAQPPPQPRKKRPEDFKFGKI LGEGSFSTVVLARELATSREYAI KI LSKRH 1 1 K 120 



oy 


121 


Db 


121 


Oy 


181 


Db 


181 


Oy 


241 


Db 


241 


Oy 


301 


Db 


301 


Oy 


361 


Db 


361 


Oy 


421 


Db 


421 


Oy 


481 


Db 


481 


oy 


541 


Db 


541 



Mllllltlllllillllllllllllll Illllllllllllllllllllllllll 



iiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiitiiiiiiiiiiiiiiiiiiiii 



fimiiiimiiiiniiiii iiiiiiiiiitiiiiiiiiiiiiiiiiiiiii 



tlllllllllllllllllllllltllllllllllllllllllllllllllllllllilll 



mm 



iiiiiiiiiiiiiiiiiiiiiiiiiiiifiiiiiiiiiiiiiiiiiniii 



IIIIIIIIINIIIIllllliUIIIIIMillllllllillllllllllllllllllll 



IIIIIIIIIIIIIIIIIIMIIIIIIIIIItlHIIIIIIIIIIIIIIIIIIIIIIHII 



lllllllllllllll! 



RESULT 3 
QS3HJ9 HUMAN 



Q53HJ9_HUMAN PRELIMINARY; PRT; 556 AA. v 
QS3HJ9; 

24 -MAY- 2005, integrated into UniProtKB/TrEMBL. 
24 -MAY -2005, sequence version 1. 
07 -FEB -2006, entry version 6. 

3-phosphoinositide dependent protein kinase-1 variant (Fragment). 
Homo aapiena (Human) . 

Bukaryota; Metazoa; Chordata; Craniate; vertebrate; Euteleostomi; 
Mammalia; Butheria; Buarchontoglires; Primates; Catarrhini; Hominidae; 
Homo. 

NCBI TaxID-9606; 
tU " 

NUCLEOTIDE SEQUENCB. 
TISSUB-Coronary artery; 

MEDLINE-94171032; PubMed- 812 5298 ; 001-10.1016/0378-1119(94)90802-8; 
Maruyama K. , Sugano B. ; 

•01 igo- capping : a simple method to replace the cap structure of 
euearyotlc oRNAs with oligorlbonucleotides. •; 
Gene 138:171.174(1994). 
[2] 

NUCLEOTIDE SEQUENCB. 



RC TISSUE-Coronary artery; 

RJC MSDLIKE-9B0389a6; PuhMed-9373149; DOI - 10 . 101 6/S037B - 1 1 1 9 ( 97) 0041 1 -3 ; 

RA Suzuki Y-. Yoahitomo K. , Karuyama K. , Suyama A. , Sugano 8. ; 

RT 'Construction and characterization of a Cull length-enriched and a 5'- 

RT end-enriched cDNA library. ".- 

RL Oene 200:149-156(1997). 

RN (3] 

RP NUCLEOTIDE SEQUENCE. 

RC T I S5UB- Coronary artery; 

RA Suzuki Y. , Sugano 6., Totoki Y. , Toyoda A. , Takeda T.. Sakaki Y. , 

RA Tanaka A. , Yokoyana S. ,- 

RL Submitted (APR-3005) to the EMBL/GenBank/DDBJ databases. 

CC SIMILARITY : Belong, to the Ser/Thr protein kinaae family. 

CC 

CC Copyrighted by the UniProt Coneort iutn, aee http://www.uniprot.org/terma 

CC Distributed under the Creative Comraona Attribution-NoOeriva Licenae 

CC 

DR EKBL; AK322S81; BAD96301.1; -; mRNA. 

DR SMRj QS3HJ9; 72-358, 409-556. 

DR Ensembl; ENSG00000140992 ; Homo eapiene. 

DR GO; 00:0005524; F : ATP binding; ISA. 

DR OO; 00:0000166; F: nucleotide binding; IEA. 

DR OO; 00:0004674; P: protein eerine/ threonine kinaae activity; ISA. 

DR GO; 00:0016740; P ; tranaf erase activity; IEA. 

DR GO; GO: 0006468; P: protein amino acid phosphorylation; IEA. 

DR InterPro; I PRO 11993; PH typo. 

DR InterPro; IPR000719; Prot_kinaee. 

DR InterPro; IPR008271; Sor_thr_pkin_AS . 

DR InterPro; IPR002290; Ser thr_pkinaae. 

OR InterPro; IPR00124S; Tyrjkinase. 

DR Pfam; PF00069; Pkinaoe; 1. 

OR ProDom; PD000001; Prot_kinaae; 1. 

DR SMART; SM00220; S_TKc; 1. 

OR PROSITE; PS00107; PROTEIN.KINASE_ATP; 1. 

OR PROSITE; PS50011; PROTE I N~K I NASE_DOM ; 1. 

DR PROSITE; PS00108; PROTE I N~KI NASB~ST ; 1. 

KW ATP-binding; Kinaae; Nucleotide -binding; 

KW Serine/ threonine -protein kinaae; Tranaf erase. 

FT N0N_TER 1 i 

SQ SEQUENCE 556 AA; 63126 MM; BEED4F6E7C7D958C CRC64; 

Query Match 99.51; Score 2926; DB 2; Length SS6; 
Beat Local Similarity 99.5%; Pred. No. 3.4e-183; 

Matches 553; Conservative 0; Mi oma tehee 3; Indole 0; Gapa 0; 

Oy 1 MARTTSQL YDAVP I QS SWLCS CPS PSMVRTQTE SSTP PG I PGG SRQGPAMDGTAAE P RP 60 

I II! Mil II! IMMII III llll III III I III ill III I 111 IIMII I III Ml M 

Db 1 MARTTSQL YDAVP I QS SWLCS CPS PSMVRTQTBSSTP PG I PGGSRQGPAMDGTAAE P RP 60 

Qy 61 GAGSLOHAOPPPQPRiOOlPEOFKFGKILGEGSFSTWLARELATSREYAIKILBKRHIIK 120 

lilllllllllltllllllllllllllllllllSlflllllllllllllllllillllll 

Db 61 GAGSLQHAQPPPQPRKKRPEDFKFGKILGEGSFSTVVXARELATSREYAIKILEKRHI IK 120 



Qy 

Db 



121 ENKVPYVTRERBVMSRLDHPFFVKLYPTFQDDEKLYFGLSYAKNGELLKYIRKIGSFDET 180 

lllllllllllllllillllllltllllllllllllllllllllllllllllllllllfl 
121 ENKVP YVTRERDVMS RLDH PF FVKL Y FT FQDOE KL YFG LS YAKNGBLLKY 1RKIGSFDBT 180 



Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


oy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


461 


Db 


4B1 


oy 


541 


Db 


541 
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iiiiiiiiMiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiii nun 



in iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiniiiiiiii 



I II IMMII I IIIIIIIIIIIIMII lllllll HUM llllllll III III III III 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIlllllllllllllllllilllllll 



llllllllllllllll 



RESULT 4 
PDPK1 RAT 
ID PDPK1_RAT 
AC OS5173; 
DT 18-OCT-2001 
DT 01-JUN-1998 
DT 07-MAR-2006 



STANDARD; 



PRT; 559 AA. 



integrated into UniProt KB/Swiae-Prot. 
sequence veraion 1. 
entry veraion 42 . 

DB 3 -phoaphoinoai tide -dependent protein kinaae 1 (EC 2.7.1.37) {Protein 

DE kinaae B kinase) (PkB kinaae) . 

ON Name-Pdpkl; Synonyms *Pdkl; 

OS Rattua norvegicua (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniate; Vertebrate; Buteleoatomi; 

OC Manmalia; Eutheria; Euarchontoglirea; Glirea; Rodentia; Sciurognathi; 

OC Muroidoa; Muridae; Murinae; Rattua. 

OX NCBI_TaxID-10116; 

RN [1] 

RP NUCLEOTIDE SEQUENCE {MRNA) . 

RC TISSUE-Brain; 

RX MBDLZNB-9aill410; PubMed-9445477; DOI-10 . 1 126/ science . 27 9 . S3 51 . 710 ; 

RA Stephena L.R., Anderson K.B., Stokoe D., Brdjument-Bromage K. , 

RA Painter G.P., Holmes A.B., Gaffney P.R.J. , Reese C.B., McCormick F. , 

RA Tempat P., Coadwell W.J. , Hawkins P.T.; 

RT "Protein kinase B kinases that mediate phosphatidylinoaitol 3,4,5- 

RT triaphoaphate-dependent activation of protein kinaae B."; 

RL Science 279:710-714(1998) . 

CC -I- FUNCTION: Phosphorylatea and activates not only PKB/AKT, but also 
CC PKA, PKC-zete, p70S6K and p90S6K/RSK. May play a general role in 

CC signaling processes and in development (By similarity) . 



-I- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP + a phoephoprotein . 

-t- SUBCELLULAR LOCATION: Cytoplasmic and membrane -associated after 
cell stimulation leading to its translocation. Tyrosine 
phosphorylation seems to occur only at the plasma membrane (By 
aimilarity) . 

- I - PTM: Phosphorylated on tyrosine and serine/ threonine. 

Phosphorylation on Ser-244 in the activation loop is required for 

full activity. PDK1 itself can autophosphorylate Ser-244, leading 

to ita own activation (By similarity) . 
-!- SIMILARITY: Belonge to the Ser/Thr protein kinase family. PDK1 

subfamily. 
-!- SIMILARITY: Containa 1 PH domain. 

Copyrighted by the UniProt Consortium, see http://www.uniprot.0rg/1 
Distributed under the Creative Common a Attribution-NoOeriva License 



EMBL; Y15748; CAA75758.1; -; mRNA. 

HSSP; 015530; 1H1W. 

SKR; 055173; 75-361, 412-559. 

Bnsembl; ENSRNOG00000006136; Rattus norvegicus. 
RGD; 620307; Pdpkl . 
InterPro; IPR001849; PH. 
InterPro; IPR011993; PH_type. 
InterPro; IPR000719; Prot_kinase. 
InterPro; IPR008271; Ser_thr_pkin_AS. 
InterPro; IPR002290; Ser_thrj>kinaae. 
InterPro; I PRO 01 24 5; Tyrjjkinaee. 
Pfam; PFOO069; Pkinase; 1. 
ProDom; PD000001; Prot_kinase; 1. 
SMART; SM00220; S_TKc; 1. 
PROSITE; PSS0003; PH_DOMAIN; FALSB_NEG. 
PROSITE; PS00107; P ROTE I N_KI NASE_ATP ; 1. 
PROSITE; PS50011; P ROTE I N_KI NAS E_ DOM ; 1. 
PROSITE; PS00108; PROTEIN_KINASB_ST; 1. 
ATP-binding; Kinase; Membrane; Nucleotide-binding; 
Serine/ threonine -protein kinase; Tranaf erase. 



Phosphorylation; 



PT 


CHAIN 


1 


559 


3 -phosphoinositide -dependent protein 


FT 
FT 








kinaae 1. 

/FTId-PRO_0000086502 . 


FT 


DOMAIN 


85 


345 


Protein kinaae. 


PT 


DOMAIN 


462 


553 


PH. 


PT 


NP_BIND 


91 


99 


ATP (By aimilarity) . 


FT 


COMPBIAS 


392 


399 


Poly-Ser. 


PT 


ACT_SITB 


200 


208 


Proton acceptor (By similarity). 


FT 


BINDING 


114 


114 


ATP (By aimilarity) . 


PT 


MOD RES 


9 


9 


Phosphotyrosine (By similarity) . 


FT 


MOO RES 


25 


25 


Phoaphoaerine (By similarity) . 


FT 


KOO_RES 


244 


244 


Phoephoserine (by autocatalysis) (By 


FT 








aimilarity) . 


PT 


MOD_RES 


376 


376 


Phoaphotyroeine (By similarity) . 


FT 


MOD~RBS 


379 


379 


Phosphotyrosine (By similarity) . 


FT 


KOO~RES 


396 


396 


Phoaphoaerine (By aimilarity) . 


FT 


MOD~RES 


399 


399 


Phoaphoaerine (By aimilarity) . 


PT 


MOD_RES 


406 


406 


Phoaphoaerine (By similarity) . 


FT 


mod jibs 


413 


413 


Phoaphoaerine (By aimilarity) . 


SQ 


SEQUENCE 


5S9 AA; 


63609 MW 


ADB70A7F6C2A20BP CRC64; 



Query Match 94.8%; Score 2787.5; DB 1; 

Beat Local Similarity 95.2%; Pred. No. 4.1e-174; 
Matches 532; Conaervative 6; Mismatches 18; 



Qy 


1 


Db 


1 


Qy 


60 


Db 


61 


oy 


118 


Db 


121 


Qy 


178 


Db 


181 


oy 


238 


Db 


241 


Qy 


298 


Db 


301 


oy 


358 


Db 


361 


oy 


418 


Db 


421 


Qy 


478 


Db 


481 


oy 


538 


Db 


541 
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iiiiiimiiiiiimmiiniiiiiiiiiiiiiiiiiiiiihiiiiiiiiiii 
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Ml MIIMM II I IIMII I Mill 



IIIIMIMIIIIIIIMIIIIIIIIIIMII 

,ILKMOPVDKRKGLFARRRQLL 



IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIHI 



MIMMIMM IIMII 



417 
420 



RESULT S 
PDPKl_MOUSE 

ID PDPKl_KOUSS STANDARD; PRT; 5S9 AA. 

AC Q9Z2A0; 09R1D8; Q9R21S; 

DT 18-OCT-2001, integrated into Uni ProtKB/Swi as -Prot. 
DT 18-OCT-2001, sequence veraion 2. 
DT 07-MAR-2006, entry version 44. 

DB 3 -phoaphoinoai tide -dependent protein kinase 1 (EC 2.7.1.37) (mPOKl) . 
OH Name-Pdpkl; Synonyms - Pdkl ; 
OS Mua muaculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniate; Vertebrate; Buteleoatomi; 

alia; Eutheria; Euarchontoglirea; Glirea; Rodentia; Sciurognathi; 



Muroidea; Muridae; Murinae; Hum. 
NCBI TaxID-10090; 
(1) " 

NUCLEOTIDE SEQUENCB (KRNA) . 
TISSUE-Livar; 

MEDLINB-99175193; PubMed- 10075713; 001-10. 1074/ jbc. 274 . 12. 8117; 
Dong L.Q.. Zhang R.-B.. Lenglaie P.. Ha H. , Clark H. , Zhu L. . Liu P.; 
"Primary •tructura, tieeue distribution, and expreeeion of mouse 
phoephoinoeitide-dependent protain kinase- I, a protain kinase that 
phoephorylatee and activataa protain kinaae C lata."; 
J. Biol. Chem. 274:8117-8122(1999). 
(2) 

NUCLEOTIDE SEQUENCE (KRNA) . 

TISSUB-Brein; 

Park J., Hemmings B.A.; 

"Mouse phoaphoinoaitlda-depandont protein kinaaa 1 (mPDKl)."; 
Submitted (FEB- 1999) to the BKBL/GenBank/DDBJ databaaea. 
[3] 

NUCLEOTIDE SEQUENCE [KRNA] . 

STRAIN-C57BL/6; 

Xu P. , Taylor S. ; 

Submitted (JUL-1998) to the EKBL/OenBank/DDBJ databaaea. 

•!- FUNCTION: Phoaphorylatea and activates not only PKB/AKT, but alao 

PKA, PKC-zeta. p70S6K and p90S6K/RSK. Kay play a general role in 

aignaling proceaaes and in development. Could alao play a role in 

aax differentiation proceaaea. 
-I- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP ♦ a phoaphoprotein. 
-!- SUBCELLULAR LOCATION: Cytoplaemic and membrene-eesociated after 

cell atiraulation leading to ita trana location. Tyroaino 

phosphorylation aeema to occur only at the plasma membrane. 
-1- TISSUE SPECIFICITY: Highly expressed in heart, brain, liver and 

testis, also expreaaad in embryonic cells. 
-!- PTM: Phosphorylated on tyrosine and serine/threonine. 

Phosphorylation on Ser-244 in the activation loop is required for 

full activity. PDK1 itself can autophoaphorylate Ser-244, leading 

to its own activation (By similarity) . 
-J- SIMILARITY: Belonga to the Ser/Thr protein kinase family. PDK1 

subfamily. 

SIMILARITY: Contains 1 PH domain. 



Copyrighted by the UniProt Conaortiuro, aee http://www.uniprot.org/terma 
Distributed under the Creative Commona Attribution-NoDerive License 

EKBL; AF086625; AAC67544.1; •; mRNA. 
EKBL; AF126294; AAD38S0S.1; -; mRNA. 
EKBL; AF079S35; AAC9611S.1; mRNA. 
HSSP; 015530; 1H1H. 
SKR; Q9Z2A0; 75-361, 412-5S9. 
IntAct; Q9Z2A0; 

Bnsembl; ENSKUSG00000024122; Hue musculus. 

KOI; KGI:1338068; Pdpkl. 

GO; GO:0005737; C:cytoplaam; IDA. 

00; GO: 00 16023; C: cytoplaemic membrane -bound veaicla; IDA. 

GO; 00:0004676; F:3-phosphoinos it ide -dependent protein kinase. . .; IDA. 

GO; GO:0006972; P: hyperosmotic response; IDA. 

GO; 00:0007165; P:signal transduction; TAS. 

InterPro; IPR001849; PH. 



InterPro; IPR011993; PH_type. 
InterPro; IPR000719; Prot_kinaae. 
InterPro; I PRO 08371; Ser_thr _pkin_AS. 
InterPro; IPR002290; Ser~thr_pkinaee. 
InterPro; IPR001245; Tyrjpkinaae. 
Pfam; PP00069; Pkinase; 1. 
ProDora; PD000001; Prot kinase; 1. 
SMART; SM00220; S TKc; 1. 
PROS ITS; PS50003;~PH_DOMAIN; FALSB_NEO . 
PROSITB; PS00107; P ROTE I N_KI NASB_ATP ; 1. 
PROSITE; PSS0011; PROTBIN^KINASEJMM; 1. 
PROSITB; PS00108; PR0TE I N_KI NASE_ST ; 1. 

ATP-binding; Kinase; Membrane; Nucleotide -binding; Phosphorylation; 
Serine/threonine-protein kinaae; Transferaae. 



FT 
FT 


CHAIN 




559 


3 -phoophoinofl 1 1 id© - dependent proLfein 
kinaae 1. 


FT 








/FTId-PRO_0000086501 . 


FT 


DOMAIN 


85 


345 


Protain kinase. 


FT 


DOMAIN 


462 


553 


PH. 


FT 


NP BIND 


91 


99 


ATP (By similarity) . 


FT 


COMPBIAS 


392 


401 


Poly-Ser. 


FT 


ACTSITE 


308 


206 


Proton acceptor (By similarity) . 


FT 


BINDING 


114 


114 


ATP (By similarity) . 


FT 


MOD_RBS 


9 


9 


Phosphotyrosine (By similarity) . 


FT 


K0O RES 


25 


25 


Phosphoserine (By similarity) . 


FT 


kod'rss 


244 


244 


Phosphoserine (by autocatalysia) (By 


FT 








eimilarity) . 


FT 


HOD RSS 


376 


376 


Phosphotyrosine (By similarity) . 


FT 


M0D~RBS 


379 


379 


Phosphotyrosine (By similarity) . 


FT 


MOD RES 


396 


396 


Phosphoserine (By eimilarity) . 


FT 


MOD RES 


399 


399 


Phoaphoaerine (By similarity) . 


FT 


MOD RES 


406 


406 


Phosphoaerine (By eimilarity) . 


FT 


MOD_RES 


413 


413 


Phosphoserine (By similarity) . 


FT 


CONFLICT 


84 


84 


D -> N (in Ref . 1) . 


FT 


CONFLICT 


248 


248 


T -> P (in Ref. 3) . 


FT 


CONFLICT 


285 


285 


F -> S (in Ref. 3) . 


FT 


CONFLICT 


546 


546 


W - > R (in Ref. 3). 


50 


SEQUENCE 


559 AA 


; 63759 


KW; F2A617A27460FAC9 CRC64; 



Query Match 94.2%; 
Best Local Similarity 94.6%; 
Katches 529; Conservative 



Score 2769.5; 0B 1; 
Pred. No. 6.2e-173; 
I; Mismatches 19; 



Length 559; 



Qy 1 KARTTSQL YDA VP I QSSWLCSCPS PSMVRTQTBSSTPPO I PGG - S RQG PAKDGTAAE PR 59 

imiiii iiiiiiiiiiiiiiiiiiiiiihii! : mi i mi mi n i 

Db 1 MARTTSQLYDAVP I QSSWLCSCPS PSMVRSQTEPGSS PG I P SGVSRQGSTKDGTTAEAR 60 

Qy 60 PGAGSLQH - -AQPPPQPRKKRPEDFKFGKILGEGSFSTVVLARELATSREYAI KILEKRH 117 

61 PSTNPLOWPAQLPPQP^ 120 

Qy 118 IIKENKVPYVTRERDVKSRI^HPFFVXLYFTFOTDEKLYFGLSYAKNGELLKYIRKIGSF 177 

Hill I Illllllllllllinilll 1IIIM IIIMIII 

Db 121 I I KENKVPYVTRBRDVKSRLDHPFFVKLYPTFQDDEKLYFGLSYAKNGELLKYI RKIGSF 180 

Qy 178 DBTCTRFYTAEI VSALEYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAKVLSPBSKQA 237 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



0y 
Db 

Qy 



1B1 DETCTRFYTABIVSALBYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAKVLSPESKQA 240 

238 RANSFVGTAQYVSPELLTBKSACKSSDLHALGCI I YQLVAGLPPFRAGNEYLI FQKI I KL 297 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

241 RANSFVGTAQYVSPELLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KL 300 

298 EYD F PE KFPP KARD LVEKLLVLDATKRLGCEEMEGYG PLKAH P F FE SVTWENLHQQTP PK 357 

II lilllllllllllllillllllllllllllllllllllllllh:|lll Ill 

301 EYH FPE KFF P KARD L VE KLLVLDATKRLGCEEKEGYG PLKAH P PFET I TWSNLHQQTP PK 360 

358 LTAYLPAKSB0DEDCYGNYDNL1£QFGCMQVSSSSSSHSL£ASDTGLPQRSGSNIBQYXH 417 

MM HI III I Hi III III I III II I IIIMIII Mill I MM II I III Mil 

361 LTA YLPAKS EDDEDCYGNYDNLLSOFG FMQVS S SSS SHS LSTVETSLPQRSGSN I BQY I H 420 
418 D LDSNS FBLDLQFSEDE KRLLLB KOAGGNPHHQFVENNL I LKMG PVDKRKGLFARRRQLL 477 

III IIIIIIIIIIMIIIIIIIIIIMIIIIIIU lllllll III III I III III III I 

421 D LDTNS FBLDLQFSEDE KRLLLB KQAGGNPHHQPVBNNL I L KKG PVDKR KGLFARRRQLL 480 
4 78 L TEG PHLYYVD PVNKVLKGE I PHSQELRPEAKNFKT F FVHT PNRTYYLKD PSGNAH KWCR 537 

i iii Minn illinium i nun iii iii i iii iii iii mi i iii iii iii i 

481 LTEG PHLYYVD PVNKVLKGE I PWSQELRPEAKNFKTFFVHTPNRTYYLKDPSGNAHKMCR 540 

538 KI QEVWRQR YQSH PDAAVQ 556 

lllllllhllhllllll 
541 KI QEVWRQQYQSNPDAAVQ 559 



RESULT 6 
Q3UHZ0J4OUSB 

ID Q3UHZ0_MOUSB PRELIMINARY; PRT; 559 AA. 

AC Q3UHZ0 ; 

DT ll-OCT-2005. integrated into UniProt KB/ TrBKBL. 

DT ll-OCT-2005, sequence version 1. 

DT 21-FEB-2006, entry version 8. 

DB CDNA, RIKEN full-length enriched library, clone: I9201BBK11 product:3- 

DB phoaphoinositide dependent protein kinase- l, full insert sequence. 

GN Name- Pdpkl ; 

OS Mus musculus (Mouse) . 

OC Bukaryota; Metazoa; Chordata; Craniate; Vertebrate; Butaleostomi; 

OC Mammalia; Butheria; Buarchontoglires; Gliree; Rodentie; Sciurognathi; 

OC Kuroidea; Kuridao; Kurinae; Hum . 

OX NCBIJTaxID- 10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX KSDLINB-992792S3; PubMed- 1034 9636 ; DOI-10 . 1016/S0076-6B79 (99) 03004 -9; 

RA Caminci P., Hayaehizaki Y. ; 

RT -High-efficiency full-length cDNA cloning."; 

RL Methods Bnzymol. 303:19-44(1999). 

RN [21 

RP NUCLEOTIDE SEQUENCE. 

RX PubMod-16141072; DOI-10. 1126/ecienco. 1112014; 

RA Caminci P., Kaaukawa T. , Katayama S., Cough J. , Frith M.C. , Haeda N. , 

RA Oyama R. , Ravasi T. , Lanhard B. , Wells C, Kodzius R. , Shimokawa K. , 

RA Bajic V.B., Brenner S.B., Batelov S., Forreet A.R.. Zavolan M. , 

RA Davie M.J., Wllming L.O., Aidinis V., Allen J.B.. 

RA Ambeei-Implombato A., Apweller R . , Aturaliya R.H., Bailey T.L., 

L M. , Baxter L. , Beieel K.W., Beraano T. , Bono H., Chalk A.M., 



RA Chiu K.P.. Choudhary V., Chriatoffela A.. Clutterbuck D.R., 

RA Crowe M.L., Dalle B. , Dalrymple B.P., de Bono B., Delia Gatta G.. 

RA di Bernardo 0., Down T., Engstrom P., Fagiolinl M. , Faulkner G., 

RA Fletcher CP. , Fukuahima T. , Furuno M. , Futaki S., Gariboldi H. , 

RA Georgii -Hemming P., Gingeraa T.R., Gojobori T. , Green R.B., 

RA Guatincich S., Harbere M. , Hayaahi Y., Hensch T.K., Hirokawa N. , 

RA Hill D., Huminiecki L., Iacono N. , Ikeo K. , Iwama A., Iehikawa T. , 

RA Jakt M. , Kanapin A., Katoh M . , Kawaaawa Y. , Kelso J., Kitamura H. , 

RA Kitano H. , Kolliaa O. , Kriahnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D. , Lipovich L. . Liu J., 

RA Liuni S.. McWilliam Kadan Babu M. , Madera M. , Marchionni L. , 

RA Matauda H., Matsuzawa S., Miki H., Mignone F. , Miyake S., Morris K. . 

RA Mottagui-Tabar S., Mulder N. , Nakano N. , Nakauchi H., Ng P., 

RA Niloeon R., Nishiguchi S-, Nishikawa S., Nori P., Ohara O. , 

RA Okazaki Y., Orlando V., Pang K.C. , Pavan H.J. , Pavesi G., Pesole G., 

RA Petrovaky N. , Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M. , 

RA Roet B., Ruan Y. , Salzberg S.L., Sandelin A., Schneider C. , 

RA Schonbach C. , Sekiguchi K. , Sample C.A., Seno S., Sessa L. , Sheng Y. , 

RA Shibata Y. , Shimada H., Shimada K. , Silva D . , Sinclair B. , 

RA Sperling S., Stupka B . , Sugiura K. , Sultana R. , Takenaka Y., Taki K. , 

RA Tammoja K. , Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A. 

RA Ueda H.R., van Nimwegen B., Verardo R., Wei C.L. , Yagi K. , 

RA Yamanishi H., Zabaroveky B . , Zhu S., Zimraer A., Hide W., Bult C, 

RA Grimmond S.M., Teaedale R.D., Liu B.T. , Brusic V., Quaekenbueh J., 

RA Hahlestedt C. , Mattick J.S., Hume D.A. , Kai C. , Sasaki 0.. Tomaru Y. . 

RA Fukuda S. , Kanamori -Katayama M. , Suzuki M. , Aoki J., Arakawa T. , 

RA Iida J., Imamura K. , Itoh M. , Kato T. , Kawaji H. , Kawagashira N. , 

RA Kawashima T., Kojima M. , Kondo S., Konno H-. Nakano K. , Ninomiya N. , 

RA Niahio T., Okada M. . Pleesy C. , Shibata K. , Shiraki T. , Suzuki S., 

RA Tagami M-, Waki K. , Watahiki A.. Okamura-Oho Y. , Suzuki H . , Kawai J., 

RA Hayashizaki Y.; 

RT 'The transcriptional landscape of the mammalian genome.*; 

RL Science 309:1559-1563(2005). 

RN [31 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed- 1614 1073; DOI-10 . 1126/science. 1112009; 

RG RIKEN Genome Exploretion Research Group, and Genome Science Group 

RG (Genome Network Core Team) and the FANTOM Consortium; 

RT "Antisense Transcription in the Mammalian Tranacriptome . ■ ; 

RL Science 309:1564-1566(2005). 

RN (4J 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE- 223 54683; PubMed- 12466851 ; DOI-10 . 103 8/natureO 1266; 

RA Okazaki Y. , Furuno M. , Kasukawa T. , Adachi J., Bono H. , Kondo S. , 

RA Nikaido I.. Osato N . , Saito R., Suzuki H-, Yamanaka I., Kiyosawa H.. 

RA Yagi K. , Tomaru Y., Haaagawa Y. , Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R . , Hill D.P., Bult C. . Hume D.A., Quackanbush J., 

RA Schrlml L.M., Kanapin A., Matsuda H. , Batalov S., Beieel K.H. , 

RA Blake J. A., Bradt D. , Brusic V., Chothia C, Corbani L.B., Cousins S., 

RA Dalle B. , Dragani T.A., Fletcher CP., Forrest A., Frazer K.6., 

RA Gaaaterland T. , Gariboldi M. , Giaai C. , Godzik A., Gough J., 

RA Grimmond S., Gustincich S. , Hirokawa N., Jackson I. J., Jarvis B.D., 

RA Kanai A., Kawaji H. , Kawaaawa Y., Kedzlerekl R.K. , King B.L., 

RA Konagaya A., Kurochkin I.V. , Lee Y., Lenhard B . , Lyons P. A., 

RA Maglott D.R., Maltaia L. , Marchionni L. , KcKenzie L. , Miki H. , 

RA Nagaahlma T. , Numata K. , Okido T., Pavan W.J. , Pertea O. , Peeole a., 

RA Petroveky N. , Pillei R . , Pontius J.U., Ql D. . Ramachandran S. , 



Ravaai T., Reed J.C., Reed D.J., Reid J . , Ring B.2., Ringwald M. , 
Sandelin A.. Schneider C. . Semple C.A. , Setou M. . Shimada K. , 
Sultana R. , Takenaka Y. , Taylor M.S. , Teaadale R.D., Tomita M. , 
Verardo R. , Wagner L. , Wahleatedc C. . Hang Y. , Watanabe Y., Hello c. , 
Wilming L.O. , Wynehaw-Boria A. , Yanagisawa M. , Yang I., Yang L., 
Yuan Z., Zavolan M. , Zhu Y., Zimmer A., Caminei P., Hayateu K. , 
Hirozene-Kiehikawa T. . Konno H.. Nakamura M. , Sakazume N. , Sato K. , 
Shiraki T.. Waki K. , Kawai J. , Aizawa K. , Arakawa T., Fukuda S.. 
Kara A., Hashizume W. , Imoteni K. , Ishii Y., Itoh M. , Kagawa I., 
Miyazaki A., Sakai K. , Saaaki □ . , Shibata K. , Shinagawa A., 
Yasunishi A., Yoahino H- , Waterston R . , Lander E.S., Rogere J., 
Birney B . , Hayaehizaki Y. ; 

"Analysis of the taouae trans crip tome baaed on functional annotation of 
60,770 full-length cDNAs."; 
Nature 420:563-573(2002). 
tS] 

NUCLEOTIDE SEQUENCE. 

MEDLINE-2 1065660; PubMed-11217851 ; DOI-10 . 1038/3505S500 ; 
Kawai J.. Shinagawa A., Shibata K. , Yoahino M., Itoh M. , Iahii Y. , 
Arakawa T.. Hara A.. Pukuniahi Y., Konno H., Adachi J., Fukuda S., 
Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H., Kondo S. , Yamanaka I., 
Saito T. , Okazaki Y. , Gojobori T. , Bono H-, Kaaukawa T., Saito R. , 
Kadota K. , Mateuda H.A., Aehbumer H. , Batalov S., Caaavant T., 
Fleiochmann W. , Gaaaterland T. , Gieai C. , King B . , Kochiwa H.< 
Kuehl P., Lewie S. . Matauo Y., Nikaido I., Peaole G . , Quackenbuah J. , 
Schriral L.M., Staubli P. , Suzuki R. , Tomita M. , Wagner L. , Waahio T. , 
Sakai K. , Okido T. , Puruno M., Aono H, , Baldarelli R., Barah G. , 
Blake J.. Boffelli D. , Bojunga N. , Caminei P.. de Bonaldo M.P., 
Brownetein H.J., Bult C. , Fletcher C. , Fujita M. . Gariboldi M. , 
Guatincich S. ( Hill 0. , Hofmann M.. Hume D.A. , Kamiya M. , Lee N.H. , 
Lyona P.. Marchionni L., Mashima J., Mazzarelli J., Mombaerte P.. 
Nordone P., Ring 8.. Ringwald M. , Rodriguez I., Sakamoto N., 
Saaaki H., Sato K. . Schoenbach C. , Seya T-. Shibata Y., Storch K.-P., 
Suzuki H. , Toyo-oka K. , Wang K.H., Weitz C, Whittaker C. , Wilming L. , 
Wynahaw-Borie A., Yoohida K. , Haaegawa Y. , Kawaji H. , Kohtauki S., 
Hayaahizaki Y. ; 

•Functional annotation of a full-length mouse cDNA collection."; 

Nature 409:685-690(2001). 

16) 

NUCLEOTIDE SEQUENCE. 

HEDLINE-20499374; PubMed-11042159; DOI-10 . 1101/gr. 145100 ; 

Carninci P., Shibata Y . , Hayatsu N. , Sugahara Y., Shibata K. , Itoh M., 

Konno H., Okazaki Y., Muramatau M . , Hayaehizaki Y.; 

■Normalization and subtraction of cap- trapper-selected cDNAa to 

prepare full-length cDNA libraries for rapid diacovery of new genee."; 

Genome Res. 10:1617-1630(2000). 

(7] 

NUCLEOTIDE SEQUENCE. 

MEDLINB-20S30913; PubMed- 11076861; DOI-10 . 1101/gr . 152600; 
Shibata K. , Itoh M. ( Aizawa K. , Nagaoka S. ( Saaaki N. , Carninci P., 
Konno H. , Akiyama J., Niehi K. , Kitsunai T. , Tashiro H . , Itoh M., 
Sumi N., Iahii Y., Nakamura S. , Hazama M. , Niehine T., Harada A., 
Yamamoto R. , Matsumoto H., Sakaguchi S., Ikegami T.. Kaahiwagi K. , 
Fujiwake S. , Inoue K. , Togawa Y. , Izawa M. , Ohara E. , Watahiki M. , 
Yoneda Y., lahikawa T. , Ozawa K., Tanaka T. , Matsuura S., Kawai J., 
Okazaki Y. , Muramatau M. , Inoue Y. , Kira A., Hayaahizaki Y . ; 
•RIKEN integrated sequence analysis (RISA) system- 3 84 -format 



RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN (8) 

RP NUCLEOTIDE SEQUENCE. 

RA Arakawa T. . Carninci P. , Fukuda S. , Hashizume W. , Hayaahida K. , 

RA Hori P., lida J. . Xmamura K. , Imotani K. , Itoh M. , Kanagawa S., 

RA Kawai J., Kojima M. , Konno H. , Murata M. , Nakamura M. , Ninomiya N. . 

RA Niahiyori H., Nomura K. . Ohno H. , Sakazume N. a Sano H. , Saaaki D . , 

RA Shibata K. , Shiraki T. , Tagami M., Tagami Y.. Waki K. . Watahiki A., 

RA Muramatsu H. , Hayaahizaki Y. ; 

RL Submitted {MAR -20 04) to the EMBL/GenBank/DDBJ databases. 

CC -!- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP ♦ a phoephoprotein. 

CC -i- SIMILARITY: Belongs to the Ser/Thr protein kinase family. 

CC 

CC Copyrighted by the UniProt Conaortium, see http://www.uniprot.org/terTOS 

CC Distributed under the Creative Commons Attribution-NoDeriva License 

CC 

DR EMBL; AK147149; BAE27716.1; -; mRNA. 

DR MGI; MGI: 1338068; Pdpkl . 

DR GO; GO: 0005737; C: cytoplasm; IDA. 

DR GO; 00:0016023; C: cytoplasmic membrane -bound vesicle; IDA. 

DR GO; 00:0004676; P : 3 -phosphoinoei tide -dependent protein kinase. . .; IDA. 

DR GO; CO; 0006972; P : hyperosmotic response; IDA. 

DR GO; GO: 0007 165; P: signal transduction; TAS. 

DR IntorPro; IPR000719; Proteinase. 

DR InterPro; IPR008271; Ser_thr_pkin_AS. 

DR InterPro; IPR002290; Ser_thr _pkinase. 

DR InterPro; IPR001245; Tyr_pkinaee. 

DR Pfam; PF00069; Pkinase; 1. 

DR ProDom; PD000001; Protjtinase; 1. 

DR SMART; SHOO 220; S_TKc;~l. 

DR PROSITE; PS00107; PROTEIN KINASE_ATP; 1. 

DR PROSITE; PS50011; PROTBINJCINASE_DOM; 1. 

DR PROSITE; PS00108; PROTE I N_KI NASB_ST ; 1. 

KW ATP-binding; Kinase; Nucleotide-binding; 

KW Serine/threonine-protein kinase; Tranaferaae. 

SQ SEQUENCE 559 AA; 63729 MW; F0D6AED27161F6B8 CRC64; 

Query Match 94.0%; Score 2764.5; DB 2; Length 559; 

Beat Local Similarity 94.5%; Pred. No. 1.3e-172; 

Matchea S28; Conservative 8; Mismatches 20; Indels 3; Gaps 2; 

Qy 1 HARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPGIPGG-SROGPAMDGTAAEPR 59 

1 I I t I I 1 1 1 I 1 I 1 I 1 I I I 1 1 I I t 1 I I I I I I ^ 1 1 I : Mil I I I II Mil II I 
Db 1 MARTTSQLYDAVP I QSSWLCSCPSPSMVRSQTE PGSS PG I PSGVSRQGSTKDGTTAEAR 60 

Qy 60 PGAGSLQH - -AQPPPQPRKKRPBDFKFGKILGEGSFSTVVLARSLATSREYAI KILEKRH 117 

i ii ii iiiiiiMiiiiiiiimiiifiiimiiiiiiiiiiiimiii 

Db 61 PSTNPLQQH PAQLP PQPRKKRPEDFKFGKI LQEGSFSTWLARELATSREYAI KI LEKRH 120 



Qy 118 IIKENKvTYVTRERDWSRLDHPFFVTaYFTFQDDBKLYFGLSYAKNGELLKYIRKIGSF 177 

lllilllllllllllllllllllilllllllllllllllllllHIIIIIIIIIIIIIII 
Db 121 IIKENKVPYVTRERDV^RLDHPFFVKLYFTFQDDEKLYFGLSYAKNGELLKYIRKIGSF 180 

Qy 178 DBTCTRFYTABIVSALBYLHGKGIIHPXL10?ENILLNEDMHIQITDFGTAKVLSPESKQA 237 

II llllilllllllllllllllllllllllllllillllllllllllllllllllllll 
Db 181 DEACTRFYTAE I VSALB YLHG KG I I HRD LKPBN I LLNBDMH I Q I TD FGTAKVLS PES KQA 240 



Db 241 RANSPVGTAQYVSPBLLTEKSACKSSDLWALGCI IYQLVAGLPPFRAGNEYLI FQKI I KL 300 

Qy 298 EYD P PBKF PP KARDLVEKLLVLDAT KRLGCE EMEG YG P LKAHP FFESVTWENLHQQT P P K 357 

II MlllilllMlllllillMII II: IMIIII III MM IM Ml II lltlllll 

Db 301 EYH F PE KFFP KARDLVEKLLVLDAT KRLGCE EMEG YG PLKAH PFFET I TWENLHQQTPP K 360 

Qy 358 LTA YLPAMSEDDEDCYGNYDNLLSQ FGCMQVS SS SSSHSLSASDTGLPQRSGSN I EQY I H 417 

lllllllllllllllllllllllllll lllllllllllll = 1 II IIMIIMI III 
Db 361 LTAYLPAMS EDDEDCYGNYDNLLSQ FG FMQVS SS SSSHSLSTVETS LPQRSGSN I EQY I H 420 

Qy 418 DLDSNSFELDLQFS EDEKRLLLBKQAGGNPWHQFVENNL I L KMG PVDKR KGLFARRRQLL 477 

lihiiiimiiimiiiiiiiiiiitmiiiiiiiiiiimiiiMiiiiiiiii 

Db 421 DLDTNS FB LDLQFSEDEKRLLLE KQAGGNPWH QFVENNL I L KMG PVDKR KGLFARRRQLL 480 

Qy 478 LTEGPHLYYVDPVNKVLKGE I PWSQELRPEAKNPKTFFVKTPNRTYYLMDPSGNAHKWCR 537 

lllllllllilllillllilllllllllilllllllllllllllllllllllllllllll 

Db 481 LTEGPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKWCR 540 

Qy 538 KI QBVWRQRYQSH PDAAVQ 556 

Db 541 KIQEVWRQQYQSNPDAAVQ 559 



RESULT 7 
Q3UGN6_M0USB 

ID 03UGN6_MOUSE PRELIMINARY; PRT; 5S9 AA. 

AC Q3UGN6; 

DT 11-OCT-200S, integrated into UniProtKB/TrEMBL. 

DT ll-OCT-2005, sequence version 1. 

DT 21-PB8-2006, entry version 8. 

DB Melanocyte cDNA, RIKEN full-length enriched library, clone :G270062J02 

DB product: 3 -phoaphoinoei tide dependent protein kinase-1, full insert 

DB sequence . 

GN Name- Pdpkl; 

OS Mua musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrate; Buteleostomi; 

OC Mammalia; Eutheria; Sua rchontogl ires; Glires; Rodent ia; Sciurognathi; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI TaxID- 10090; 

RN (11 

RP NUCLEOTIDE SEQUENCE . 

RC STRAIN-CS7BL/6J; 

RX MEDLINE- 992792 53; PubMed- 10 34 963 6; DOI-10. 1016/50076-6879(99)03004-9; 

RA Caminei P.. Hayaehizaki Y. ; 

RT "High-ef f iciency full-length cDNA cloning.'; 

RL Methoda Enzymol. 303:19-44(1999). 

RN (2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; 

RX PubMed-16141072; DOI-10. 1126/science. 1112014; 

RA Caminei P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N. , 

RA Oyaraa R. . Ravasi T. , Lenhard B. . Wells C. . Kodzius R., Shimokawa K. , 

RA Bajic V.B., Brenner S.B., Batalov S., Forrest A.R., Zavolan M. , 

RA Davis M.J.. Wilming L.O., Aidinis V.. Allen J.B.. 



RA Ambesi-Impiombato A., Apweiler R. , Atureliya R.N., Bailey T.L., 

RA Banaal M. , Baxter L. , Beieel K.W., Boraano T. , Bono H. , Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., 

RA Crowe M.L., Dalle B. , Dalrymple B.P., de Bono 8., Delia Gatta G. , 

RA di Bernardo D. , Down T . , Engstrom P., Fagiolini M. , Faulkner G., 

RA Fletcher C.F., Pukushima T., Furuno M., Futaki S. , Gariboldi M., 

RA Georgii -Hemming P., Gingeraa T.R., Gojobori T. , Green R.B., 

RA Guatincich S-, Harbers M., Hayashi Y. , Henach T.K., Hirokawa N. , 

RA Hill D. i Huminiecki L. , lecono M., Ikeo K. , Iwama A., lahikawa T. , 

RA Jakt M. i Kanapin A., Katoh M. , Kawasawa Y. , Kelso J., Kitamura K. , 

RA Kitano H. , Kollias G. , Krishnan S.P.. Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareeu L.P., Lazarevic D. , Lipovich L., Liu J., 

RA Liuni S., McWilliam S.. Madan Babu M. , Madera M. , Marchionni L., 

RA Hatsuda H., Matsuzawa S., Miki H., Mignone P., Miyake S., Morris K. , 

RA Mottagui-Tabar S. , Mulder N., Nakano N. , Nakauchi H . , Ng P. , 

RA Nilsson R. , Niehiguchi S. , Nishikawa S., Nori P., Ohara o. , 

RA Okazaki Y. , Orlando V., Pang K.C., Pavan W. J. , Paveei G. , Peaole G. , 

RA Petrovsky H. , Piazza S., Reed J., Reid J. P., Ring B.Z., Ringwald M. , 

RA Rost B. , Ruan Y. , Salzberg S.L., Sandelin A. , Schneider C. , 

RA Schonbach C. , Sekiguchi K. , Semple C.A. , Seno S., Sessa L. , Sheng Y. , 

RA Shibata Y., Shimada H. , Shimada K. , Silva D . , Sinclair B. , 

RA Sperling S., Stupka E . , Sugiura K. , Sultana R., Takenaka Y. , Taki K. , 

RA Tammoja K. , Tan S.L., Tang S., Taylor M.S., Tegner J. , Teichmann S.A. , 

RA Ueda H.R., van Nimwegen E . , Verardo R., Wei C.L., Yagi K. , 

RA Yamanishi H. ( Zabarovsky B. , Zhu S., Zimroer A., Hide W. , Bult C. , 

RA Grimmond S.M., Teaadale R.D., Liu E.T., Brusic V., Quackenbush J. , 

RA Wahleatedt C. , Mattick J.S., Hume D.A., Kai C. . Sasaki D., Tomaru Y. , 

RA Fukuda S. , Kanamori -Katayama M., Suzuki M. , Aoki J., Arakawa T., 

RA lida J., Imamura K. , Itoh M. , Kato T., Kawaji H., Kawagaahira N. , 

RA Kawashima T. , Kojima M. , Kondo S. , Konno H., Nakano K. , Ninomiya N. , 

RA Nishio T., Okada M. , Pleasy C, Shibata K. , Shiraki T. , Suzuki S. , 

RA Tagami M. , Waki K. , Watahiki A., Okamura-Oho Y-, Suzuki H. , Kawai J., 

RA Hayashizaki Y.; 

RT "The transcriptional landacape of the mammalian genome."; 

RL Science 309:1559-1563(2005) . 

RN (31 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; 

RX PubMed-1 614 1073; DOI-10 . 1126/ecience. 1112009; 

RG RIKEN Genome Bxploretion Research Group, and Genome Science Group 

RG (Genome Network Core Team) and the FANTOM Conaortium; 

RT "Antieense Tranacription in the Mammalian Tranacriptome. • ; 

RL Science 309:1564-1566(2005) . 

RN 14] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN- CS7BL/6J; 

RX MEDLINE-223546B3; PubMed-1 24 668 51 ; DOI-10. 1038/nature01266; 

RA Okazaki Y. , Puruno M. , Kaaukawa T. , Adachi J., Bono H. , Kondo S. , 

RA Nikaido I., Oaato N. , Saito R. , Suzuki H., Yamanaka I., Kiyoeawa H, , 

RA Yagi K. , Tomaru Y. , Haaegawa Y., Nog ami A., Schonbach C. , Gojobori T-, 

RA Baldarelli R., Hill D.P., Bult C. , Hume D.A. , Quackenbush J., 

RA Schriral L.M., Kanapin A. , Matauda H. , Batalov S., Beiael K.W., 

RA Blake J. A., Bradt D. , Brusic V., Chothia C. , Corbani L.B., Cousins S. , 

RA Delia B. . Dragani T.A. , Fletcher C.F., Porrest A., Frazer K.S. , 

RA Gaaaterlend T. , Gariboldi M. , Giasi C. , Godzik A., Gough J. , 

RA Grimmond S., Guatincich S. , Hirokawa N. , Jackaon I.J., Jarvia B.D., 

RA Kanai A., Kawaji H. , Kawaaawa Y. , Kedzieraki R.M. , King B.L., 



RA Konagaya A. , Kurochkin I.V., Leo Y. . Lenhard B . , Lyons P. A. , 

RA Maglott D.R.. Maltaia L. , Marchionni L. , McKenzie L. , Miki H.. 

RA Nagashime T., Nuraata K. , Okido T. , Pavan W.J. , Pertea Q . , Fesole C, 

RA Petrovsky H. , Pillai R. , Pontius J.U., Qi D. , Raraachandren S., 

RA Revaai T., Reed J.C.. Reed D.J.. Reid J. , Ring B.2.. Ringwald M. , 

RA Sandelin A.. Schneider C. , Semple C.A. , Setou M. , Shimada K. . 

RA Sultana R . , Takenaka V., Taylor M.S., Teaadale R.D., Toraita M. , 

RA Verardo R . , Wagner L. , Wahlestedt C. , Wang Y . , Watanabe Y. , Walls C, 

RA Wilming L.O. , Wynahaw- Bori o A., Yanagisawa H., Yang I., Yang L. , 

RA Yuan Z. , Zavolan M.„ Zhu Y. , Zitmter A., Carninci P., Hayatsu N. , 

RA Hirozano-Kiahikawa T. , Konno H. . Nakamura M. , Sakazums N. < Sato K. , 

RA Shiraki T. , Waki K. , Kawai J-, Aizawa K. , Arakawa T.. Fukuda S., 

RA Hara A., Hashizuote W. ( imotani K. , Iahii Y. , Itoh H. a Kagawa I., 

RA Hiyazaki A., Sakai K. , Sasaki D., Shibata K. , Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander B.S., Rogers J., 

RA Birney E. , Kayashizaki Y.j 

RT "Analysis of the mouse tranacriptome baaed on functional annotation of 

RT 60.770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002) . 

RN [5] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; 

RX MEDLINE-21065660; PubMed-112178Sl; DOI-10 . 103B/35055500; 

RA Kawai J., Shinagawa A.. Shibata K. , Yoshino N., Itoh M. , Ishii Y. , 

RA Arakawa T. , Hara A.. Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K. . Izawa M. , Nishi K. , Kiyoaawa H . , Kondo S. . Yamanaka I., 

RA Saito T., Okazaki Y. , Gojobori T., Bono H . , Kasukawa T. , Saito R. , 

RA Kadota K. , Matsuda H.A., Ashburnor K. . Batalov S. . Caaavant T., 

RA Fleiaehmann W., Gaasterland T., Gissi C. . King B. . Kochiwa H . , 

RA Kuehl P., Lewis S. , Matsuo Y. , Nikaido I.. Pesole G. , Quackenbush J., 

RA Schriml L.M., Staubli P.. Suzuki R. , Tomita H. , Wagner L. , Washio T. , 

RA Sakai K. , Okido T. ( Furuno M . , Aono H., Baldarelli R.. Barsh G. , 

RA Blake J., Boffelli D . , Bojunga N. , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C. , Fletcher C. , Fujita M. , Gariboldi M. , 

RA Gustincich S. , Hill D. . Hofmann M. , Hume D.A., Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Morabeerta P., 

RA Nordone P., Ring B. , Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Saaaki H. , Sato K. , Schoenbach C. , Seya T. , Shibata Y. , Storch K. -F. . 

RA Suzuki H., Toyo-oka K. . Wang K.H.. Weitz C. , Whittaker C. , Wilming L. , 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y. , Kawaji H . , Kohtsuki S., 

RA Kayashizaki Y. ; 

RT -Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001) . 

RN (6] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-C57BL/6J; 

RX MEDLINE-20499374; PubMed- 11042 159; DOI-10 . 1101/gr . 145100 ; 

RA Carninci P., Shibata Y., Hayatsu N. . Sugahara Y. . Shibata K. , Itoh M. , 

RA Konno H., Okazaki Y., Muramatsu M. , Hayashizaki Y. ; 

RT "Normalization and subtraction of cap- trapper- selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [71 

RP NUCLEOTIDE SEQUENCE . 

RC STRAIN* CS 7 BL/ 6 J ; 

RX MEDLINE-20530913; PubMed-1 10 76861 ; DOI-10 . 1101/gr. 152600; 

RA Shibata K. , Itoh M. , Aizawa K. , Nagaoka S., Saaaki N. , Carninci P., 



RA Konno H., Akiyama J., Nishi K. , Kitsunai T. , Tashiro H. , Itoh M . , 

RA Sum! N. , Ishii Y., Nakamura S., Hazama M . , Nishine T., Harada A., 

RA Yamamoto R. , Matsumoto H.. Sakaguchi S., I keg ami T. , Kashiwagi K. , 

RA Fujlwake S., Inoue K. , Togawa Y., Izawa M. . Ohara B. . Watahiki M. , 

RA Yoneda Y. , Ishikawa T. , Ozawa K. , Tanaka T. . Matsuura S. , Kawai J. , 

RA Okazaki Y.. Muramatsu M. , Inoue Y. . Kira A.. Hayashizaki Y.j 

RT "RIKBN integrated sequence analysis (RISA) system- 3 84 -format 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN IB] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-C57BL/6J; 

RA Arakawa T., Carninci P., Fukuda S., Kashizume W. , Hayashida K. , 

RA Hori F , , Iida J., Imamura K. , Imotani K. . Itoh M. , Kanagawa S., 

RA Kawai J. , Kojima M. , Konno H., Murata M. , Nakamura H. , Ninomiya H. , 

RA Nishiyori H. , Nomura K. . Ohno M. , Sakazume N. , Sano H. , Sasaki D. , 

RA Shibata K. , Shiraki T., Tagami M. . Tagami Y. , Waki K. , Watahiki A., 

RA Muramatsu M. , Hayashizaki Y.j 

RL Submitted (MAR-2004) to the EMBL/ GenBank/DDBJ databases. 

CC -!- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP ♦ a phosphoprotein. 

CC -!- SIMILARITY: Belongs to the Ser/Thr protein kinase family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons AtCribution-NoDerivs License 

CC 

DR EMBL; AK147835; BAE28171.1; -; mRNA. 

DR MGI; MGI:133B068; Pdpkl . 

DR GO; GO: 0005737; C: cytoplasm; IDA. 

DR GO; GO: 0016023; C : cytoplasmic membrane -bound vesicle; IDA. 

DR GO; GO:0004676; F:3-phosphoinositide-dependent protein kinase. . .; IDA. 

DR GO; GO: 0006972; P hyperosmotic response; IDA. 

DR GO; 00:0007165; P: signal transduction; TAS. 

DR Inter Pro; IPR000719; Protjcinase. 

DR InterPro; IPR008271; Ser_thr_pkin_AS . 

DR InterPro; IPR002290; Ser_thr_pkinase . 

DR InterPro; IPR001245; Tyr_pkinaae. 

DR Pfam; PF00069; Pkinase; 1. 

DR ProDom; PD000001; Prot kinase; 1. 

DR SMART; SM0O22O; S TKc;~l . 

DR PROSITE; PS00107; PROTBIN_KI NAS B_AT P ; 1. 

DR PROSITE; PS50011; PROTE I N_KI NAS E_DOM ; 1. 

DR PROSITE; PS00108; PROTE I N_KI NASB_ST ; 1. 

KW ATP-binding; Kinase; Nucleotide -binding; 

KW Serine/threonine-protein kinase; Transferase. 

SQ SEQUENCE 5S9 AA; 63687 MW; 74461D027460FAC6 CRC64; 

Query Match 93.8%; Score 2758.5; DB 2; Length 5S9; 

Best Local Similarity 94.3%; Pred. No. 3.3e-172; 

Matches 527; Conservative 9; Mismatches 20; Indels 3; Gaps 2; 
Qy 1 MAKTTSQLYDAVPIQSSWL^SCPSPSKVRTQTESSTPPGI POG-SRQGPAMDOTAAEPR 59 

iiniiiiiiiitniiiiiiiiiiiiiihiii « Mil I I I I I I It I II I 

Db 1 KARTTSQLYIlA.VPIQS5V\nXSCPSPSKVRS0TEPGSSPGIPSGVSRC^STMIXrrTAKAR 60 

Qy 60 PGAGSLQH - -AQPPPQPRKKRPEDFKFGKILGEGSFSTVVLARELATSREYAIfCI LEKRH 117 

I II II lllllllllllllllllllllllllllllllllllllllllllllll 
Db 61 PSTNPLC^HPAQLPPQPRKKRPEDFKFGKILGEGSFSTVVLARBLATSRBYAIKILEKRH 120 



oy 


118 


Db 


121 


0y 


176 


Db 


131 


Oy 


238 


Db 


241 


Oy 


298 


Db 


301 


oy 


358 


Db 


361 


Oy 


418 


Db 


421 


Oy 


478 


Db 


481 


Oy 


538 


Db 


541 



III I III IIIIIMIII1IIIMIIIIIII MM IIIMII IIIIIIIIIIIIIMMII 



llllllllllllill 



iiiiiiiiimmiiniiiimiuiiimiiiiii 



iiiiimnmiiiiiiiniiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimii 

TIKL 300 



ii iiiiiiiimiiimiiiiiiiiiiiiiiiiii iih:iiiiiiiiuii 



nilllMIIIIIIIII llllllllll IMIIIIIMIII ■ I illlMIII MM I 



418 DLDSNSFBLDLQFSEDE 



PWHQFVENNLILKMGPVDKRKGLFARRRQLL 477 



IM .IIMIIIHMIMMIMMIMIMMMMIIMMMMMMMM Mill 



M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 = 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I M 1 1 1 



III 1 1 1 1 - 1 1 1 * 1 1 1 1 N 



RESULT 8 
0810Z4 MOUSE 

ID Q810Z4_KOUSE PRELIMINARY; PRT; 551 AA. 

AC Q810Z4; 

DT Ol-JUN-2003, integrated into UniProtKB/TrEMBL. 

DT 01-JUN-2003, sequence version 1. 

DT 07-FEB-2006, entry version IS. 

DB PDK1 (Fragment) . 

GN Name- Pdpkl ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrate; Euteleoatorai; 

OC Kamnalia; Butherie; Euarchontoglires; Glires; Rodentia; Sciurognathi ; 

OC Kuroidea; Muridae; Murines; Mus. 

OX NCBI Taxi D- 100 90; 

RN (1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-129/SvJ; 

RA Brathwaite M., Waeltz P., Schlessinger D. , Nagaraja R.; 

RL Submitted (OCT- 2002) to the EMBL /GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the Ser/Thr protein kinaae family. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terma 

CC Distributed under the Creative Commons Attribution -NoDeriva License 



|R EMBL; AY162410; AA017164.1; -; Genomic_DNA. 

IR KSSP; O15S30; 1H1W. 

|R SMR; QS10Z4; 67-353, 404-551. 

IR Bnsembl; ENSMUSG00000024122 ; Mus musculus. 

|R GO; 00:0005524; F : ATP binding; ISA. 

|R GO; 00:0000166; F: nucleotide binding; IEA. 

iR GO; GO: 0004674; F:protein serine/threonine kinase activity; IEJ 

•R GO; GO: 0016740; F: transferase ectivity; IEA. 

iR GO; GO: 0006466; P: protein amino acid phosphorylation; IEA. 

IR InterPro; IPR011993; PH_type. 

»R InterPro; IPR000719; Proteinase- 

IR InterPro; IPR008271; Ser_thr_pkin_AS. 

IR InterPro; I PRO 02 290; Ser~thrj>kinaee. 

IR InterPro; IPR001245; Tyr_pkinase. 

IR Pfam; PF0Q069; Pkinase; 1. 

IR ProDom; PD000001; Prot_kinase; 1. 

IR SMART; SM00220; S_TKc; 1. 

IR PROSITE; PS00107; PROTE IN_KI NASE_ATP ; 1. 

IR PROSITE; PS50011; PROTEIN KINASE DOM; 1. 

>R PROSITB; PS00108; PROTE I N_KI NASE_ST ; 1. 

W ATP-binding; Kinase; Nucleotide -binding; 

3* Serine/threonine-protein kinase; Transferase. 

T NON TER 1 1 

!Q SEW^ENCB 551 AA; 62870 MW; ACC31DS14392B2F4 CRC64 ; 

Query Match 92.9%; Score 2732.5; DB 2; Length 551; 

Best Local Similarity 94.6%; Pred. No. 1.6e-170; 

521; Conservative B; Mismatches 19; Indels 3; 



oy 


9 


Db 


1 


oy 


66 


Db 


61 


oy 


126 


Db 


121 


Qy 


186 


Db 


181 


Oy 


246 


Db 


241 


oy 


306 


Db 


301 


oy 


366 


Db 


361 



iiiiiiiiMiiiiiiiiMiMiii : 1 1 1 1 i mi mi ii n ii 

YDAVPIQSSVVlXXCPSPSMVRSQTEPGSSPGIPSGVSRCflS^ 60 
- - AQP P PQPRKKRPEDFKFG KI LGEGS FSTWLARB LATSRBYA I KI LBKRH 1 1 KENKVP 125 

II lllllllllllllllllllllllllllllllilllllllllllllllllllNII 



MUM III I IMMMI III I IIMIM 1 1 IMIIII I IMIMIIIII MM 1 1 II Ii 



IMM I Ml MMMII I MM MM MM MMIIII I MMMI I MMMI Mill I 



MMMMIMMMMMIMIIMMMMMMMMMMMMIIMM lllll 



MMMMIMIMIMMIIMMMMMMMMMMMMMIIIMMMIMI 



niiiiiiiiiniiiiii 1 1 1 1 1 1 1 1 1 1 1 i t 



MIMIMMIIIIIMMMI 



Oy 426 LDU)FSEDEKRLI^KQ\GGNPWHQFVENNLILK>C;WDKRKCLFARRRQLIJ J TEGPHLY 4 85 

421 ldlofsbdskrlllbk^^ 4 so 

Oy 466 YVDPVNXVLKGE I PWSQBlJIPEAKNPTCTPFVHTPNRTYYIJtDPSGNAHKWCRKI QEVWRQ 545 

481 iiUSK S40 

Qy 546 RYQSHPDAAVQ 556 

=111=111111 
Ob 541 QYQSNPDAAVQ 551 



PRELIMINARY; PRT; 533 AA. 



RESULT 9 
QSK3L3 MOUSE 
10 Q8K3L3 KOU5B 
AC Q8K3L37 

01-OCT-2003, integrated into UniProtKB/TrEKBL. 
01 -OCT- 2003, sequence veraion 1. 
31-FEB-2006, entry version 22. 

Phosphoinoei tide -dependent protein kinase- 1 bote. 
Nsme-Pdpkl; Synonyms- Pdklbete; 
Mus raueculus (House) . 

Eukaryota; Ketszoe; Chordata; Creniete; Vertebrate; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglirea; Glirea; Rodentia; Sciurognathi; 
Muroidea; Muridae; Murinae; Mus. 
NCBI TaxID-10090; 
[1J 

NUCLEOTIDE SEQUENCE . 

MEDLINB-22050196; PubMed-12054753 ; DOI-10 . 1016/S0006-291X (02 > 00449-7 ; 
Dong L-Q. • Ramoa F.J., Wick M.J., Lim M.A., Guo Z. , Strong R., 
Richardson A., Liu P.; 

"Cloning and characterization of a testis and brain-specific isoform 
of mouse 3 1 -phosphoinosi tide -dependent protein kineeo-1, raPDK-1 
beta . ■ ; 

Biochem. Biophys. Res. Commun. 294:136-144(2002). 
- !- CATALYTIC ACTIVITY: ATP ♦ a protein • ADP ♦ a phoaphoprotein. 
-!- SIMILARITY: Belongs to the Ser/Thr protein kinase family. 



DT 
DT 
DT 



RN 
RP 



RT 



CC 
CC 



Copyrighted by the UniProt Consortium, aee http://www.uniprot.org/terma 
Dietributed under the Creative Commons Attribution-NoDerive License 



EMBL; AY062008; AAL47185.1; -; raRNA. 

HSSP; 015530; 1H1W. 

SKR; Q8K3L3; 48-334, 385-532. 

Ensembl; ENSMUSG00000034133 ; Mus mueculus. 

HOI; KG I ; 1338068; Pdpkl . 

GO; 00:0005737; C:cytoplasm; IDA. 

GO; GO: 0016023; C:cytoplaamic membrane-bound vesicle; IDA. 
GO; GO: 0004676; F: 3 -phosphoinosi tide -dependent protein kinase. 
GO; GO: 0006972; P hyperosmotic response; IDA. 
GO; GO:0007165; P:signal tranaduction; TAS. 
InterPro; IPR011993; PH_type. 
InterPro; IPR000719; Prot_kinase. 
InterPro; IPR008271; Ser_thr _pkin_AS. 
InterPro; IPR002290; Ser_thr_pkinaee. 



DR InterPro; IPR00124 5; Tyr_pkinase. 

DR Pfam; PF00069; Pkinaae; 1. 

DR ProDom; PD000001; Prot kinase; 1. 

DR SMART; SM00220; S TRcj~l. 

DR PROSITB; PS00107 ;""pROTBIN KINASB_ATP; 1. 

OR PROSITB; PS50011; PROTEINJUNASE_DOM; l. 

DR PROSITB; PS00108; PROTB I N_K I NASB_ST ; 1. 

KW ATP -binding; Kinase; Nucleotide-binding; 

KW Serine/ threonine -protein kinaae; Transferase. 

SO SBQUBNCB 532 AA; 60934 KW; F9073 1C7ECDSE58 9 CRC64; 

Query Match 89.3%; Score 2626.5; 0B 2; Length 

Beat Local Similarity 94.2%; Pred. No. 1.4e-163; 
Matches 501; Conservative 9; Mismatches 19; Indels 
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RESULT 10 
Q5F3U4_CHICK 
ID Q5P3U4_CHICK 
AC QSP3U47 

DT 15-HAR-2005, integrated into UniProt KB/ TrEMBL. 



PRELIMINARY; PRT; 



556 AA. 



DT 15-MAR-2005, sequence veraion 1. 

DT 21-PEB-2006, entry version 10. 

DB Hypothetical protein. 

GN ORFNamee-RCJMB04_6pl0; 

OS Oallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata,- Vertebrate; Euteleostomi ; 

OC Archoaauria; Avea; Neognathae; Galliformea; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID-9031; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CB; TISSUE-Bursa; 

RA Caldwell R.B., Kierzek A.M., Arakawa H., Bezzubov Y., Zaim J., 

RA Fiedler P., Kutter S., Blagodatski A., Kostovska 0., Koter M. , 

RA Plachy J., Carninci P., Hayaahizaki Y., Bueratedde J.M.; 

RT "Full-length cDNAs from chicken bursal lymphocytes to facilitate 

RT genef unction analysis."; 

RL Genome Biol. 6:R6-R6 (2005) . 

CC -I- CATALYTIC ACTIVITY: ATP ♦ a protein - ADP ♦ a phoaphoprotein. 

CC -I- SIMILARITY: Belongs to the Ser/Thr protein kinaae family. 

CC - 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terma 

CC Distributed under the Creative Commons Attribution -NoDerivs License 

CC - 

DR EMBL; AJ8S1SS6; CAK65190.1; -; raRNA. 

DR SMR; OSF3U4; 75-361, 412-S55. 

DR GO; 00:0005524; F : ATP binding; IEA. 

DR GO; GO: 0000166; {^nucleotide binding; ISA . 

OR GO; GO: 0004674; F:protein serine/threonine kinase activity; IEA. 

DR GO; GO: 0016740; F:tranaferaae activity; ISA. 

DR GO; 00:0006468; P:protein amino acid phosphorylation; IEA. 

DR InterPro; IPR011993; PH_type. 

DR InterPro; IPR000719; Prot_kinase. 

DR InterPro; IPR008271; Ser_thr_pkin_AS . 

DR InterPro; IPR002290; Ser_thr_pkinase. 

DR InterPro; IPR00124S; Tyr_pkinaae. 

DR Pfam; PF00069; Pkinase; 1. 

DR ProOom; PD000001; Proteinase; 1. 

DR SMART; SM00220; S_TKc;~l. 

DR PROSITB; PS00107; PROTB I N__KI NASB_ATP ; UNKNOWN_l . 

OR PROSITB; PS50011; PROTB I N~K I NAS E_DOM ; 1. 

DR PROSITB; PS0010B; PR0TEIN~KINASB~ST; 1. 

KW ATP-binding; Hypothetical'protein; Kinase; Nueleotide-binding; 

KW Serine/ threonine -protein kinase; Transferase. 

SO SEQUENCE 556 AA; 63068 MW; B1ED935D6C3B04C9 CRC64; 

Query Match 89.3%; Score 2626.5; DB 2; Length 556; 

Beat Local Similarity 90.5%; Pred. No. l.Se-163; 

Matchea 504; Conservative 15; Mismatches 33; Indels S; Gaps 4; 

Oy 1 RARTTSOLYIUV?IOSSVV1X^CPSPSMV11TOTESSTPPCIPG-GSRQGPAMDOTAAEPR 59 

II hi llllllllllllllllillllll I Mill I Mill h Mil I 

Db 1 MASTSCTLYDAVPIQSSVVIXSCPSPSMVRNOADSSTPSVISTCGSRO^SNMBSTAABSR 60 

60 PGAGSL-QH-AOPPPOPR^ 117 
Db 61 SSSNSLPQHTGOQPPQPRKKRPDDFKFGXII/IBGSFSTVVLARBLTSSRBYAIKILEKRH 120 



0y 118 I I KENKVP YVTRBROVMS RLDHP FFVKLYFTFQDDEKL YFGLSYAKNGBLL KYIRKIGSF 177 

IMIIMIIMIIMIIMIIIIIIIIIIIIIIMIIIIIMIMIIIIIMIMIMM 
Db 121 IIKENKVPYVTRERDVMSRU3HPFFVKI>YFTFQDDEKLYroLSYAKNGELLKYIRKIGSF 180 



Qy 178 DBTCTRFYTAEIVSALEYLHOKGI IKRDLKPENILLNEOMHIQITDFGTAKVLSPESKOA 237 

I III Ml MM II llll Ml I Ml III III I II I III Ml I III MIMI I Ml MMl 

Db 1B1 DETCTRFYTAB I VSALEYLHGKG I I KRD LKP EN I LLNBDMH I Q I TD FGTAKVLSADSRQA 240 

Oy 238 RANSFVGTAQYVSPBLLTEKSACKSSDLWALOCI IYQLVAGLPPFRAGNEYLI FQKI I KL 297 

241 RANSFVOTAOYVSPELLTE I YQLVAGLPPFRA(L!rEYLI ^ I KL 300 



Oy 298 EYDFPBKFFPKARDLVBIO^VUlATKRLGCEEMEOYGPLKAHPFFESVTWENUiQOTPPK 357 

IIMIIIIIIIhilMIIMIMMIMIMI MMiMMMM MM MMl 

Db 301 BYDFP E KF FP KAKD LVE KLL VLDATKRLGCE EMGGYG P LKAH P F FES I VWENLHLQTP PK 360 

Qy 358 LTAYLPAM5EDDEDCYGNYDNLLSQFGCMQVSSS S S SH SLSASDTGLPQRSGSN I EQY I H 417 

Ml I MIMI I MUM III I III IMMI II MIIMM M II II MIMM 

Db 361 LTAYLPAMSEDDEDCYGNYDNLLSQFtJCMQVSGSASSHSLSVPETSTPOrSOGNI EQY I H 420 

0y 418 DLDSNS FELDLQFS BOB KRLLLEKQAGGNPWHQFVENNL I LKMG PVDKRKGL FARRRQLL 477 

IIIMIIIIIIIMllllllll IIIIMIIIIIIIMIIIIIIMMMIMMIMM 
Db 421 OLDNNSFBLDLQFSEDE IG1LIJAKOAGGNPWHQFVBNNLI UOtGPVDKRKGLFARRRQLL 480 

Oy 478 LTEGPHLYYVDPVNKVLKGE I PWSQELRPEAKOTKTFFVHTPNRTYYLMDPSGNAHKWCR 537 

lllllllllllllllillllllll II Ml I IIMII llll III IM III I MIMI I I: 
Db 481 LTEGPHLYYVDPVNKVLKGE I PWSLELRPBAKNFKTFFVHTPNRTYYLMDPSGNJUIKWCK 540 



Oy 538 KI QBVWRQRYQSHPDAA 554 

MIMM || | :|| 
Db 541 KI QEVWRHR Y - - HQNAA SSS 



RESULT 11 
03TRL2_MOUSB 

10 Q3TRL2_MOUSE PRELIMINARY; PRT; S22 AA. 

AC 03TRL2? 

DT ll-OCT-2005. Integrated into Uni Prot KB/Tr EMBL . 

DT ll-OCT-2005, sequence version 1. 

DT 07-PEB-2006, entry veraion 7. 

DB Adult female vagina cDNA, RIKEN full-length enriched library, 

OS clone: 9930010N10 product:3-phoaphoinoaitide dependent protein kinaae- 

DB 1, full insert sequence. 

GN Name- Pdpkl; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrate; Euteleostomi; 

OC Mammalia; Eutheria; Buarchon tog 1 ires; Olires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mua. 

OX NCBI_TaxID-10090; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-C57BL/6J; TISSUB-Vagina ; 

RX MBOLINB-99279253; PubMed- 1034963 6; DOI-10. 1016/80076-6879(99) 03004-9; 

RA Carninci P., Hayaahizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Methods Bnzymol. 303:19-44(1999). 



RN 12) 

RP NUCLEOTIDE SSQUENCB. 

RC STRAIN-CS7BL/6J; TISSUE-Vagina; 

RX PubMod-16141072; DOI-10 . 1126/acience. 1112014 j 

RA Carninci P.. Kaaukawa T. , Katayama 8., Gough J. , Frith H.C., Maeda N. , 

RA Oyema R. , Ravaai T. , Lenhard B . , Walla C. , Kodziua R. , Shitaokawa K. , 

RA Bajic V.B.. Brenner SB., Batalov S., Porrest A.R.. Zavolan M. , 

RA Oavia M.J., Wilming L.G., Aidinie V., Allan J.B., 

RA Ambeei-Impiombato A. , Apweiler R. , Atureliya R.N., Bailey T.L., 

RA Banaal M. , Baxter L. , Beiaol K.W., Baraano T. , Bono H. , Chalk A.H., 

RA Chiu K.P., Choudhary V., Christoffele A.. Clutterbuck D.R., 

RA Crowe M.L. . Dal la B. , Dalrynple B.P.. da Bono B., Delia Oatta O., 

RA di Bernardo O. , Down T., Engstrora P., Fagiolini M. , Faulkner O. , 

RA Fletcher C.F., Fukuahima T. . Furuno M . , Futaki S., Oariboldi M. , 

RA Oeorgii -Heoraing P., Gingeraa T.R., Gojobori T., Green R.B., 

RA Guatincich S., Harbera M. , Hayaahi Y. » Kenach T.K., Hirokawa N . , 

RA Hill D., Huminiecki L.. lacono M. . Ikeo K. , Iwaraa A., Iahikawa T. , 

RA Jakt M. , Kanapin A.. Katoh M. , Kawaaawa Y. , Keleo J., Kiteraura H., 

RA KitenoH.. KolliaaG., Kriahnan S. p. , Kruger A. . Kttmmerfeld S.K.. 

RA Kurochkin I.V., Lareau L.P.. Lazarevic D. , Lipovich L. , Liu J. , 

RA Liuni S., McWilliara S., Ma dan Babu H. , Madera M. , Marchionni L. , 

RA Matauda H.. Metauzawa S., Hiki H., Mignone F., Miyake 8., Norria K. . 

RA Mottagui-Tabar S., Mulder N. , Nakano N. , Nakeuchi H . , Ng P., 

RA Nilaaon R . , Hiahiguchi S.. N iahikawa S.„ Mori P., Ohara O., 

RA Okazaki Y.. Orlando V.. Pang K.C., Pavan W.J., Paveai O., Peaole G . , 

RA Petrovaky N. , Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M. , 

RA Roet B., Ruan Y., Salzberg S.L., Sandelin A. , Schneider C, 

RA Schonbach C, Sekiguchi K. , Semple C.A. , Seno S., Seaaa L. , Sheng Y., 

RA Shibata Y. , Shitnada H. , Shiraada K. , Silva D. , Sinclair a., 

RA Sperling S., Stupka B . , Sugiura K. , Sultana R . , Takenaka Y. , Taki K. , 

RA Taraaoja K. , Tan S.L.. Tang S., Taylor M.S., Tegner J., Teichniann S.A., 

RA Uada H.R.. van Nimwegen B. , Verardo R . . Wei C.L.. Yagi K. , 

RA Yaraaniahi H. , Zabarovaky B. , Zhu S., Zimraer A. , Hide W. . Bult C. 

RA Grinrnond S.M., Teaadale R.D.. Liu E.T., Bruaic v., Quackenbuah J., 

RA Wahleatedt C. , Mattick J.S., Hume D.A., Kai C, Saaaki D., Totnaru Y. , 

RA Fukuda S. , Kanamori - Katayama M. , Suzuki M. , Aoki J., Arakawa T. , 

RA Iida J., Imamura K. , Itoh M. , Kato T., Kawaji H., Kawagaahira N. , 

RA Kawaehima T. , Kojiraa M-. Kendo S. , Konno H . , Nakano K. , Ninoraiya N. , 

RA Nishio T. , Okada M. , Pleeey C. , Shibata K. , Shiraki T.. Suzuki S., 

RA Tagami M. , Waki K. , Watahiki A., Okamura-Oho Y., Suzuki H. , Kawai J. , 

RA Hayaahizaki Y.j 

RT 'The transcriptional landscape of the mammalian genome. •; 

RL Science 309:1559-1563(2005) . 

RN (3) 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; TISSUE-Vagina; 

RX PubMed-16141073; DOI-10 . 1136/science. 1113009; 

RG RI KEN Genome Exploration Research Group, and Genome Science Group 

RG (Genome Network Core Team) and the FANTOM Consortium; 

RT "Antiaenae Tranacription in the Marrmalian Tranacriptome . ■ ; 

RL Science 309:1564-1S66(2005) . 

RN [4] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; TISSUE-Vagina; 

RX MEDLINE-223546S3; PubMed-12466851 ; DOI-10. 1038/nature01266; 

RA Okazaki Y. , Furuno M., Kaaukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Oaato N. , Saito R. , Suzuki H., Yamanaka I., Kiyoaawa H . , 



RA Yagi K. , Toraaru Y.. Haeegawe Y. , Nogami A., Schonbach C. Gojobori T. . 

RA Baldarelll R. , Hill D.P.. Bult C. , Hume D.A., Quackenbuah J., 

RA Schriml L.M., Kanapin A.. Matauda H.. Batalov S., Beiaal K.W., 

RA Blake J. A. , Bradt D. , Bruaic V., Chothia C. , Corbani L.B.. Cousins s., 

RA Delia B. , Dragani T.A. , Fletcher C.F., Forrest A.. Frazer K.S.. 

RA Gaaaterland T., Oariboldi M. , Glaai C. , Godzik A., Gough J. , 

RA Griramond S . , Guatincich S., Hirokawa N. , Jackaon I.J., Jarvie B.D., 

RA Kanai A.. Kawaji H . , Kawaaawa Y. , Kedzieraki R.M., King B.L., 

RA Konagaya A.. Kurochkin I.V.. Lee Y., Lenhard B. , Lyons p. A. , 

RA Maglott D.R., Maltaia L.. Marchionni L. , McKenzle L. . Miki H. , 

RA Nagaahima T., Nuaata K. , Okido T. , Pavan W.J., Pertea 0., Peaole G., 

RA Petrovaky N. , Pillai R.. Pontiue J.U., Qi D. , Ramachandran 8., 

RA Ravaai T., Reed J.C., Reed D.J., Reid J. , Ring B.Z., Ringwald K. . 

RA Sandelin A.. Schneider C, Sample C.A., Setou N. , Shi ma da K. , 

RA Sultana R. . Takenaka Y. , Taylor M.S., Teaadale R.D., Tomita M., 

RA Verardo R . , Wagner L., Wahleatedt C. , Wang Y. , Watanabe Y., Walla C. , 

RA Wi lcning L.G., Wynehaw-Boria A., Yanagieawa M.. Yang I., Yang L., 

RA Yuen Z.. Zavolan M. , Zhu Y.. Ziniaer A., Caminci P., Hayatau N. , 

RA Hirozano-Kiahikawa T. . Konno H. , Nakanura M. . Sakazune N. . Sato K. , 

RA Shiraki T. , Waki K. , Kawai J. , Aizawa K. , Arakawa T., Fukuda 8., 

RA Hara A., Haahizume W., Imotani K. , Iahil Y. , Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K. , Saaaki D. , Shibata K. , Shinagawa A. , 

RA Yaauniahi A., Yoahino M., Wateraton R. , Lander B.S., Rogers J. , 

RA Birney B. . Hayaahizaki Y. ; 

RT "Analysis of the mouae tranacriptome baaed on functional annotation of 

RT 60,770 full-length cDNAe . " ; 

RL Nature 420 : S63-573 (2002) . 

RN (S] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-CS7BL/6J; TISSUE-Vagina; 

RX MEDLINE. 21085660; PubMed- 112 17851; DOI-10 . 103B/3S055SOO; 

RA Kawai J., Shinagawa A., Shibata K. , Yoahino M. , Itoh M. , Iahii Y. , 

RA Arakawa T. , Hara A., Fukuniahi Y. , Konno H. , Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Niahi K. , Kiyoaawa H. , Kondo S. , Yamanaka I.. 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kaaukawa T., Saito R., 

RA Kadota K. , Matauda H.A. , As him me r M. , Batalov 6., Casavant T. , 

RA Fleiachmann W. , Gaaaterland T. , Giaal C. , King B., Kochiwa H. , 

RA Kuehl P., Lewie S., Matauo Y. , Nikaido I., Peaole G., Quackenbuah J., 

RA Schriml L.M., Staubli F., Suzuki R. , Tomita M. , Wagner L. , Waahio T., 

RA Sakai K. , Okido T. , Furuno M. , Aono H., Baldarelll R. , Barah G.. 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.P., 

RA Brownatein M.J. , Bult C. , Fletcher C. . Fujita M., Gariboldi M., 

RA Guatincich S., Hill D., Hofmann M. , Hume D.A.. Kamiya M.. Lee N.H., 

RA Lyona P.. Marchionni L.. Maahima J. , Mazzarelli J., Mombaerta P., 

RA Nordone P., Ring B . , Ringwald M. , Rodriguez I . , Sakannto N. . 

RA Saaaki H., Sato K. . Schoenbach C. , Seya T., Shibata Y., Storch K. -F.. 

RA Suzuki H. , Toyo-oka K. , Wang K.H. . Weitz C. , whittaker C. Wilming L. . 

RA Wynahaw-Boria A., Yoahida K. , Haaegawa Y., Kawaji H. , Kohteuki S., 

RA Hayaahizaki Y. ; 

RT "Functional annotation of a full-length mouae cDNA collection.*; 

RL Nature 409:68S-690 (2001) . 

RN [6J 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-C57BL/6J; TISSUE-Vagina; 

RX MEDLINE-20499374; PubMed-11042159; DOI-10 . 1101/gr. 145100; 

RA Carninci P., Shibata Y., Hayatau N. , Sugahara Y. , Shibata K. , Itoh M. , 

RA Konno H., Okazaki Y.. Muramatau M. , Hayaahizaki Y. ; 



■Normalization and subtraction of cap- trapper- selected cDNAs to 
prepare full-length cDNA librariee for rapid discovery of new genes."; 
Genome Res. 10:1617-1630(2000). 
[7) 

NUCLEOTIDE SEQUENCE. 
STRAIN-CS7BL/6J; TISSUE-Vagina; 

MEDLINE-20530913; PubMed-1 1076861 ; DOI-10 . 1101/gr . 152600 ; 
Shibata K. , Itoh M., Aizawa K. , Nagaoka S., Sasaki N. , Carninci P., 
Konno H. , Akiyama J., Niahi K. . Kitsunai T. , Taahiro H. , Itoh M. , 
Sumi N.. Ishii Y., Nakamura 8.. Hazama M. . Niahine T., Harada A., 
Yamaraoto R. , Mateumoto H., Sakaguchi S.. Ikegami T., Kashiwagi K. , 
Pujiwake S., Inoue K. , Togawa Y. , Izawa M. , Ohara B. , Watahiki M. , 
Yoneda Y. , Iahikawa T. , Ozawa K. , Tanaka T. , Matauura S. , Kawai J., 
Okazaki Y. , Muramatau M . , Inoue Y . , Kir a A., Hayaahizaki Y. ; 
"RIKEN integrated eequence analysis (RISA) system-384 -format 
sequencing pipeline with 384 nulticapillary sequencer."; 
Genome Res. 10:1757-1771(2000). 
(8] 

NUCLEOTIDE SEQUENCE. 
STRAIN-CS7BL/6J; TISSUE-Vagina; 

Arakawa T-, Carninci P., Fukuda S. , Haahizume W. , Hayaahida K. , 
Hori P., Iida J. , Imamura K. , Imotani K. , Itoh M. f Kanagawa S. , 
Kawai J., Kojima M. , Konno H., Murata M.. Nakamura M. , Ninomiya N. , 
Niahiyori H. , Nomura K. , Ohno M. , Sakazume N. , Sano H., Saaaki D., 
Shibata K. , Shiraki T., Tagami M. , Tagami Y., Waki K. . Watahiki A. , 
Muramatau M. , Hayaahizaki Y.; 

Submitted (APR- 2004) to the EMBL/ GenBank/ DDBJ databases. 

SIMILARITY: Belongs to the Ser/Thr protein kinase family. 

Copyrighted by the UniProt Conaortium, aee http://www.uniprot.org/terma 
Distributed under the Creative Conraona Attribution-NoOerive Licenae 



EMBL; AK162677; BAB37017.1; -; mRNA. 

MGI; MGI: 1338068; Pdpkl . 

GO; 00:0005737; C:cytoplaam; IDA. 

GO; GO: 00 16023; C: cytoplasmic membrane •bound vesicle; IDA. 

GO; GO:0004676; F : 3 -phoephoinoaitide-dependent protein kinase. 

GO; 00:0006972; P: hyperosmotic response; IDA. 

GO; GO:0007165; P:signal transduction; TAS. 

InterPro; IPR000719; Prot^kinaae. 

Inter Pro; IPR00B271; Ser_thr_pkin_AS. 

InterPro; IPR002290; Ser_thr_pkinase. 

InterPro; IPR001245; Tyrjkinase. 

Pfara; PF00069; Pkinaae; 1. 

Pro Dora; PD000001; Prot kinase; 1. 

SMART; SM00220; S_TKc;"l. 

PROSITB; PS00107;~PROTBINjaNASB_ATP; 1. 

PROSITB; PS50011; PROTE I N~ K 1 NASE_DOM ; 1. 

PROSITB; PS00108; PROTE I N~KI NASB_ST ; 1. 

ATP-binding; Kinase; Nucleotide -binding; 

Serine/ threonine -protein kinase; Transferase. 

522 AA; 59328 MW; 4C3 1B7D53F6306BF CRC64 ; 



Query Match 87.1%; 
Beat Local Similarity 94.6%; 
Matches 493; Conservative 



Score 2560.5; DB 2; 
Pred. No. 2.8e-159; 
i; Mismatches 19; 



Length 522; 



Oy 
Db 
Oy 
Db 
Oy 
Db 
Oy 
Db 
Oy 
Db 
Oy 

Db 

Qy 

Db 
Qy 
Db 
oy 

Db 



1 MARTTSQLYDAVPIQSSVVI<CSCPSPSMVRTQ/rESSTPPGIPGG-SRQGPAMDGTAAEPR 59 

llllilllllllllllllllllltlllllhlll : Mil i I I I I Mil li | 
1 KARTTSQLYDAVPIQSSVVLCSCPSPSMVRSC/rePCSSPGIPSGVSRM 60 

60 PGAGSLQH • -AQPPPQPRKKRPEDFKFGKILAEGSFSTVVLARELATSRBYAI KILEKRH 117 

I II II Ml I MMII I IIMIM III I III III MM I II I III I III MM 

61 PSTNPLQQH PAQLP PQPRKKRPED FKFGK I LGEGS FSTWLARE LATS REYA I KI LE KRH 120 
118 IIKENKVPYVTRERDVMSRLDHPFFVKLYFTFQDDBiaYFGI^AKNOELLKYIRKIGSP 177 

III MMIMIIMIIMIMMMIMMIMIMMIIMMMMMMMIIMM 

121 IIKSNKVpTVTRBRDVMSRi^HPPFVKLYFTFOBDSKLYrc ISO 
178 DETCTRFYTAEIVEALEYLHGWjI IHRDLKPENILLNEDMHIQITDFOTAKVLSPESKQA 237 

IMMIIMIMMIMM MIMIIMMIMMMMMIMIMMMMIMMM 

181 DSTCTRFYTABIVSALEYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAKVLSPBSKQA 240 
238 RANSFVGTAQYVSPBLLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNBYLIFQKI IKL 297 
241 RANSFVOTAOYVSPELLTEKSACKSS^ I YQLVAGLPPFRAcJlEYL I FQKI I KL 300 

298 EYD FPE KFFPKARDLVB KLLVLDATKRLGCEEMEGYG PLKAH P FFESVTWENLHQQTPP K 357 

II Ml MM IMIIIIIIIIIIIMIIIIMIIIIIIMIMMMMIIIIMMMI 

301 EYH FPE KFP P IGXRDLVEKLLVLDATKRLGCEBMEGYG PLKAH P FFET I TWENIJ1QQTPP K 360 
358 LTAYLPAMSEDDEDCYC^NYDNLLSQFGCMC^SSSSSSHSl£ASDTGLPQRSGSNI EQY I H 417 

1 1 1 1 1 1 1 1 1 1 1 r i e 1 1 1 1 1 r 1 1 iiiiiiiiiiin :i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

361 LTAYLP AMSEDDEDCT GNYDNLLSQFGFMQVS SSSSSHSLSTVETSLPQRSGSN I EQY I H 420 

418 D LDSNS PBLD LQFSEDEKRLLLE KOAGGNPWHQFVENNL I LKMG PVDKRKQ LFARRRQLL 477 

421 DLDTOSFBLDLQPS^ 4fl 

478 LTEGPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTFFVHT 518 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 
481 LTEGPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTFFVHT 521 



PRELIMINARY; 



PRT; 558 AA. 



(Fragment) . 



RESULT 12 
O4S4R0 TETNG 

ID Q4S4R0JTBTNG 

AC Q4S4R0; 

DT 19-JUL-200S, integrated into UniProtKB/TrEMBL. 

DT 19- JUL- 2005, sequence version 1. 

DT 07-PEB-2006, entry version 5. 

DB Chromosome 2 SCAF14738. whole genome shotgun sequel 

GN ORFKames -GSTENG0 0024 075001; 

OS Tetraodon nigroviridis (Green puffer). 

OC Eukeryota; Metazoa; Chorda ta; Craniate; Vertebrate; Euteleoetomi; 

OC Actinopterygii; Neopterygii; Teleoatei; Euteleostei; Neoteleostei ; 

OC Acanthoraorpha; Acanthopterygii; Percomorpha; Tet raodonti formes; 

OC Tetradontoidea; Tet raodont idee; Tetraodon. 

OX NCBI TaxID-99883; 

RN [1J " 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed-15496914; DOI-10 . 103a/nature03035; 

RA Jaillon O. , Aury J.>M., Brunet F. , Petit J.-L., Stange-Thoraann N. , 

RA Mauceli B. , Bouneau L., Pischer C. , Ozouf-Coataz C. , Bernot A., 



RA Nicaud S.. Jaffa D. , Fisher S., Lutfalla G. , Dos sat C. , Segurens B. , 

RA Daailva C. , Salanoubat N. , Levy M. < Boudat N. . Castellan© S., 

RA Anthouard V., Jubin C, Castelli V., Ketinka M. . Vacherla B. , 

RA Biemont C. , Skalli Z. , Cattolico L. , Pouiain J.. Da Barardinia V., 

RA Cruaud C. , Duprat S., Brottier P., Coutancoau J. -P., Gouzy J., 

RA Parra G. , Lardiar 0., Chappie C. , He Roman K.J. . McEwan P.. Boaak S.. 

RA Kellie M . , Volff J.-N., Ouigo R.. Zody M.C. , Heairov J.. 

RA Lindblad-Toh K. , Birren B . , Nusbaum C. , Kahn D. , Robinson -Rechavi M. , 

RA Laudet V., Schachter V., Quetier P., Saurin W. , Scarpelli C. , 

RA Wincker P.. Lander B.S., Weissenbach J., Roest Crolliua H.; 

RT 'Genome duplication in tha teleost fiah Tetraodon nigroviridia raveala 

RT tha early vertebrate proto-karyotype. " ; 

RL Nature 431:946-957(2004). 

RN (2| 

RP NUCLEOTIDE SEQUENCB. 

RG Genoacopo; Whitehead Inetituto Centre for Genome Research ; 

RL Submitted (FB8-2004) to the EMBL/GenBank/ DDBJ databaaea . 



CC -I- CAUTION: The aequence ahown here ia derived from an 

CC BKBL/GanBank/DDBJ whole genome shotgun (WGS) entry which ia 

CC preliminary data. 

CC - f - FUNCTION; Playa a key role in tha control of the eukaryotic cell 

CC cycle. It ia required in higher cella for entry into S-phaee and 

CC mitoaia. Component of the kinaae complex that phoaphorylatea the 

CC repetitive C-torrainus of RNA polymeraee II. Catalytic component of 

CC MPF (By eimilarity). 

CC -I- SUBUNIT: Forma a stable but non-covalent complex with eye 1 in B in 

CC mature oocytes (By similarity). 

CC -1- SIMILARITY: Belongs to tha Ser/Thr protein kinaae family. 



CC 

CC Copyrighted by the UniProt Conaortium. see http://www.uniprot.org/terms 

CC Distributed under the Creative Coranons Attribution-NoDarivs Licenae 

cc 

OR EMBL; CAAE01014738; CAG04372.1; •; Genomi c_DNA . 

DR SMR; O4S4R0; 78-364, 417-557. 

DR GO; GO: 0005524; P:ATP binding; ISA. 

OR GO; GO: 0000166; F: nucleotide binding; IEA. 

DR GO; GO: 0004 674; P: protein serine/ threonine kinase activity; ISA. 

DR GO; GO: 0016740; F : transferase activity; IEA. 

DR GO; GO: 0006468; P:protein amino acid phosphorylation; IEA. 

DR InterPrO; IPR000719; Protjcinaaa. 

DR InterPro; IPR008271; Ser_thr_pkin_AS . 

DR InterPro; IPR002290; Ser_thr_pkinaee. 

OR InterPro; IPR00124S; Tyrj>kinaee. 

DR Pfam; PF00069; Pkinaae; 1. 

□R ProDom; PD000001; Protjcinase; 1. 

OR SMART; SM00220; S_TKc;~l. 

DR PROSITE; PS00107; PR0TE I N_K I NAS B_AT P ; 1. 

DR PROSITE; PS50011; PROTBIN_KINASE_DOM; 1. 

DR PROSITE; PS00108; PROTB I N__K I NAS B_ST ; 1. 

KW ATP-binding; Kinase; Nucleotide -binding; 

KW Serine/ threonine -protein kinase; Transferaae. 

FT NON TBR 558 558 

SQ SEQUENCE 558 AA; 63806 KW; 4E0997DA7344C73A CRC64; 

Query Match Bl.3%; Score 2391.5; DB 2; Length 558; 

Best Local Similarity 81.7%; Pred. No. 3.7e-148; 

Matches 455; Conservative 43; Mismatches 52; Indels 7; Gape 4; 



Oy 


1 


NARTTSQLYDAVPIQSSVVLCSCPSPS><VtfraTBSSTPPGIPa^ 59 




III llllll INI 111 = 111 Mill 1 ! 1 =1 'II 1 h|:| ::| I 




Db 


1 




60 


Oy 


60 


PGAGSLOHAQP P - PQPRKKR PSD FKFG KI LGEGS F STWLARB LAT SRBYA I KI LE 


114 




h i 1 hit llhlhlllllllllllllllllllllil II MINIMI 




Db 


61 


LDQASQSQTQQAARPPQFQPKKKXFSDPKFGKI LGEGSFSTWIAREQATGXEYAI KILE 


130 


0y 


115 


KRH 1 1 KENKVPYVTRERDVKS RLDH PFFVKL Y PTFO£DB10, YFGLSYAJQtJSLLKY I RKI 


174 




iiiiiiiiii ii nihil hi mi iii -iiii nun i in mi i in in mi 




Db 


121 


KRH 1 1 KENKVQYVKRERDLMS I LEH PFFVKL FFTFQDDBKLY FGLSYWQJGBLLKY I RKI 


180 


0y 


175 


GSFDBTCTRFYTAEIVSALBYLHGIQSIIHRDLKFBNILLNBDMHIQITDFOT^ 


234 






llllllllllhllll llllll llllllllllllllhllllllllllllll II -\ 




Db 


181 


GSFDBTCTRPYSABIVCALBYLHSKGI IHRDLKPENILLSEDMHIQITDPGTAKQLSSDS 


240 


Oy 


235 


KQARANS FVGTAQYVSP ELLTEKSACKSSDLWALGC 1 1 YQLVAGLP P FRAGNEYL I FQKI 


2 94 






Mlllll 1 MM lllllllll 1 III III 1 III MM lllllll III III 1 lllllll III 




Db 


241 


KQARANSFVOTAQYVS P B LLTE KSACKSSDLWALGC 1 1 YQLVAGL P P FRAGNEYL I FQKI 


300 


Oy 


295 


I KLBYDFPEKFFPKARD LVE KLLVLI3ATKRLGCEEMEGYG PLKAH P PFBSVTWENLHQQT 


354 




llllhlliillllhlil Ml : MIMIIII II III lll|:-:h:|l II 




Db 


301 


I KLBYEFPEKFFPKAKD LVTQLLSQE PS KR I GCEEMGGYE PLKQH P PFDT I SWDDLKLQT 


360 


Oy 


355 


PPKLTAYLPAMSEDDEDCYC^NYDNLLSQFC'CMQVSSSSSSHSLSASD -TGLPQRSGSNI B 


413 






Mill llllll MMIM IIIIMI Ml |||: llllllll : I llll III! 




Db 


361 


PP KLT PYLPAMSEDDEDCYGNYDD LLSQFSNMQVAQSSSSHS LS PHBSTP P PQRSS SN I B 


420 


Oy 


414 


OY I HDLDSNS FELDLQFSEDE KRLLLE KQAGGNPWHQFVENNL I LKMGPVD KRKDLFARR 473 






iiiiiihiniiiiinhihiihii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


QY I HDLDNNS FELDLQFTEEE KQLLLD KQTTGNPWHQFVENNL I LKMGPVD KRKG LFARR 480 


Oy 


474 


RQLIXTEGPHLYYVDPVNKVXKGBIPWSQBLRPEAIONFKTFFVHTPOT 


533 






llllllllllllllllllllllllllll llllllllllllllllllllllllllllli 




Db 


481 


RQLLLTEGPH L YYVD PVNKVLKGE I PWS PELRPEAKNFKTFFVKT PNRTYYLMDPSGNAD 


540 


Oy 


534 


KWCRKIOEVWRQRYQSH 550 






HIMIMIIh II I 




Db 


541 


RWCKKIQBVWRRPYQRH 557 





RESULT 13 
Q59EH6_HUMAN 

ID QS 9 EH 6_HUMAN PRELIMINARY; PRT; 492 AA. 

AC Q59EH6; 

DT 26-APR-200S, integrated into Uni Pro t KB/TrEKBL . 

DT 26-AFR-2005, aequence veraion 1. 

DT 07-FBB-2006, entry veraion 7. 

DE 3-phoaphoinoaitida dependent protein kinsso-1 variant (Fragment). 

OS Homo aapiens (Human) . 

OC Bukaryota; Metazoa; Chordata; Craniate; Vertebrate; Buteleoatomi; 

OC Mammalia; Butheria; Buarchontoglires; Primates; Catarrhini; Hominidae; 

OC Homo. 

OX NCBI_TaxID-9606; 

RN (1] 

RP NUCLEOTIDE SEQUENCE. 



RC TISSUE- Brain; 

RA Totoki Y., Toyoda A., Takeda T. , Sakaki Y. , Tanaka A., Yokoyama S. 

RA Ohara O. , Nagaaa T. , Kikuno F.R.; 

RT "None Title.'; 

RL Submitted (MAR- 200 5) to the EMB L / GenBank /DDBJ databases. 

CC -1- SIMILARITY : Belongs to the Ser/Thr protein kinaae family. 

CC 



CC 
CC 



Copyrighted by the UniProt Consortium, see http://www.uniprot.org/te; 
Distributed under the Creative Commons Attribution-NoDerivs License 



>R EMBL; AB209B3S; BAD93072.1; -; mRNA. 

>R SMR; QS9EH6; 110-396. 

)R Snsembl; ENSG00000140992 ; Homo sapiens. 

>R GO; GO:000SS24; F:ATP binding; ISA. 

IR GO; GO:0000166; F;nucleotide binding; IEA. 

IR GO; GO: 0004674; F:protein serine/threonine kinase ectivity; IEA. 

)R GO; GO: 0016740; F: transferaae activity; ISA. 

)R GO; GO: 0006468; P: protein amino acid phosphorylation; ISA. 

)R InterPro; IPR00D169; Pept_cys_AS. 

»R InterPro; IPR000719; Protjcinaae. 

IR InterPro; IPR008271; Ser_thr_pkin_AS . 

IR InterPro; IPR002290; Ser_thr_pkinaae . 

IR InterPro; I PRO 0124 5; Tyrj>kinaae. 

IR Pfam; PF00069; Pkinase; 1. 

IR SMART; SM00220; S_TKc; 1. 

IR PROSITE; PS00107; PROTEINJ<INASB_ATP; UNKNOWN_l . 

IR PROSITE; PS50011; PROTEINJCINASEJJOM; 1. 

!R PROSITB; PS00108; PROTEIN_KINASB_ST; 1. 

IR PROSITB; PS00139; THIOL_PR0TBASE~CYS; UNKNOWN _1 . 

*t ATP-binding; Kinase; Nucleotide -binding; 

W Serine/threonine-protein kinase; Transferase. 

T NON TER 1 1 

IQ SEQUENCE 492 AA; 54828 KW; A8BA2A6F7C986B26 CRC64; 

Query Match 79.4%; Score 2336; DB 2; Length 492; 

Best Local Similarity 100.0%; Pred. NO. 1.4e-144; 

Matches 447; Conaervative 0; Mismatches 0; Indels 0; Gaps 0; 

ly 1 MARTTSQLYDAVP I QSSWLCSCPSPSMVRTQTBSSTPPG I PGGSRQGPAMDGTAAEPRP 60 

llllllllllllllllllllillllllllinillllllllllinillllillllllll 
lb 39 MARTTSQLYDAVPIO^SWLCSCPSPSMVRTOTBSSTPPOIPOGSROGPAMI^ 98 

fy 61 OAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGSPSTWLARBLATSREYAI KI LB KRH I IK 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
>b 99 GAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGSFSTWLARELATSRSYAI KI LEKRH 1 1 K 158 



Oy 121 ENKVFYVTRSRDVMSRLDH PFFVKL YPTFQJ>DBKLYFGI*CTAXNGBLLCT I RKIGSFDET 180 

iniiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 1S9 ENXVPYVTRERDVMSPJ^HPFFVKLYFTFQDDEKLYFGLSYAKNGBLLKYIRKIGSro 218 

Oy 181 CTRFYTABIVSALEYLHGKGI IHRDLICPENILLNEOMHIQITDFGTAKVLSPSSKOARAN 240 

II MIMIMMMIIIMMIMIIIMM llllll Ml llllllll III III llll M 

Db 219 CTRFYTABIVSALEYLHGKGI IHRDLKPBNILLNEBMHIOITDFGTAKVLSPESKOARAN 278 

Oy 241 S PVOTAQYVSPBLI>TE KSACKSSDLWALGC 1 1 YQLVAGLP P FRAGNEYL I PQKI I KLEYD 300 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiiiiiiiiin 

Db 279 SFVGTAQYVSPELLTE KSACKSSDLWALGC I IYQLVAGLPPFRAGNBYLI FQKI I KLEYD 338 



Oy 

Db 



301 FPB KF PP KARDLVE KLL VLDATKRLGCEEMEGYGPL KAH P FFESVTWENLH QQTPP KLT A 360 

lllllllllllllllllllllllllllllllllllllllllllllllllinilllllll 
339 FPB KF PP KARDLVE KLLVLDAT KR LG CE EHEG YG P L KAH P PFBSVTWENLHQQTPP KLTA 398 

361 YLPAMSEDD BDCYGNYDNLLSQFGCMQVSSS SSSH S LSASDTG LPQRSG SN I EQY I HD LD 420 

IMIIMMMIMMMMMMIMIMIMMIMIMIMMMMMIIMIMI 

399 YLPAMSEDDEDCYGNYDNLLSQFGOIQVSSSSSSHSLSASDTGLPQRSGSNIBOTIHDLD 458 
4» T^^f^^^^^> 485 



RESULT 14 
Q6NZV1_BRARB 

ID Q6 NZV 1 _ BRARB PRELIMINARY; PRT; 537 AA. 

AC Q6NZV1 ; 

DT OS -JUL- 2004, integreted into UniProtKB/TrEMBL. 

DT 05 -JUL- 2004, sequence version 1. 

DT 21-FBB-2006, entry version 20. 

DE Hypothetical protein zgc: 77318. 

GN ORFNames-zgc: 77318; 

OS Brachydanio rerio (Zebraf ish) (Danio rerio) . 

OC Eukaryota; Ketazoa; Chordata; Craniate; Vertebrate; Buteleoatomi; 

OC Actinopterygii; Neopterygii; Teleoetei; Ostariophyai; Cyprini formes; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID-7955; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE- Embryo; 

RX MEDLINB-223 88257; PubMed- 12477 932; DOI-10 . 1073/pnas - 242603899; 

RA Strausberg R.L., Feingold B.A. , Grouse L.H., Derge J.O., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer CP., Bhat N.K. , 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Hang J., Hsieh P., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.P., Casavant T.L., Scheetz T.B., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Caminci P.. Prange C, 

RA Raha 6.S., Loquelleno N.A. , Peters O.J. . Abramson R.D., Mullahy S. J. , 

RA Bosak S.A.. He Swan P. J. , McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S. , Worley K.C , Hale S., Garcie A.M., Gay L.J., Hulyk S.W. , 

RA Villslon D.K., Muzny D.M., Sodergren S.J., Lu X., Gibbs R.A.. 

RA Fahey J., Helton B. , Ketteman M. , Madan A., Rodriguea S,, Sanchez A., 

RA Whiting M., Madan A., Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blekeeley R.W., Touchman J.W., Green B.D., Dickaon M.C. 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywlnski H.I., Skalska U., Smailua D.B., 

RA Schnerch A., Schein J.B., Jones S.J.M., Marra H.A.; 

RT •Generation and initial analysis of more than 15,000 full-length human 

RT end mouse cDNA sequences.'; 



Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 
(21 

NUCLEOTIDE SEQUENCE. 



NIH MOC Project; 

Submitted (FSB -20 04) to the EMBL/GenBank /DDBJ databaaea. 



-I- CATALYTIC ACTIVITY: ATP ♦ a protein ■ ADP ♦ a phoephoprotoin. 
-!- SIMILARITY: Belongs to tha Ser/Thr protein kinase family. 

Copyrighted by tha UniProt Consortium, te« http://www.uniprot.org/teri 
Diatributad under the Creative Gommons Attribution-NoDerivs License 

EKBL; BC065959; AAH65959.1; -; mRNA. 

HSSP; Q63450; 1A06. 

SMRj Q6NZV1; 51-336, 397-532. 

Bneembl; BNSOARG00000018285; Oanio rerio. 

ZPIN; ZDB-GENE-040426-1820; ZgC:77318. 

GO; GO:000552«; V : ATP binding; ISA. 

GO; 00:0000166; P:nucleotide binding; IBA. 

GO; CO:0004674 ; P:protein aerine/threonine kinase activity; IEA. 

GO; GO: 00 16740; P: craneferaae activity; IEA. 

GO; GO: 00064 68; P.-protein amino acid phoaphorylation; IEA. 

InterPro; IPR011993; PH_type. 

InterPro; IPR000719; Proteinase. 

InterPro; IPR008271; Ser_thr _pkin_AS. 

InterPro; IPR002390; Ser_thr _pkinise. 

InterPro; IPRD01245; Tyrj>kinase. 

Pfam; PFOQ069; Pkinaae; 1. 

ProDora; PD000001; Prot_kinaae; 1. 

SMART; SM00220; S_TKc;~l. 

PROSITB; PS00107; PROTB I N_KI NASB_ATP ; 1. 

PROSITB; PSS0011; PROTB IN~KINASB*"dOM; 1. 

PROSITB; PS00108; PROTB I N~KI NASB~ST ; 1. 

ATP-binding; Hypothetical protein; Kinase; Nucleotide -binding; 
Serine/ threonine -protein kinaee; Transferase. 

537 AA; 61490 MW; 2OA0FED4B0A79D6F CRC64; 



Query Match 75.8%; 
Beat Local Similarity 77.6%; 
Matches 430; Conaervative 4 



Score 2228; DB 2; Length 537; 
Pred. No. 1.9e-137; 
1; Mismatches 54; Indels 26; 



Gaps 



1 MARTTSQLYDAVP I QSSWLCSCPS PSMVRTQTESSTP PG I PGGSRQG PAXDGTAAE PRP 60 

ill llhll i II II II Mill I - I : I I :||| I 
1 MARATSQI YDVASLHSS -VLISC- -PSMVRGQQDT PRLHGNSMEGPA P 45 

61 GAG S LQHAQP P PQ PRKKRPED PKFC K I LGEGS FSTWLARELATS REYA I KI LB KRH 1 1 K 120 

I I lllllllllhllllllllllillllhl I llllllll lllhl 

46 NA QPAQPRKKRPEDFRFGKI LGBGSFSTWLAXBHTTGKEYAI KI LEKRHIMK 98 

121 ENKVPYVTRE RDVMSRLDH PP PVKL YFTFQDD B KL YFGLSYAKNG ELLKYIRKIGSFDET 180 
III II lllhll llllllllllllllhllllllllllllllllllllllllllll 
99 ENKAQYVKRERD I MSHLDH PPFVKLYPTFQDEE KL Y FG LS YAKNG E LLKY IRKIGSFDBT 158 

181 CTRPYTAEIVSALBYLHGKOI IHRDLKPENILLNEDMHIQITDFGTAKVLSPSSKQARAN 240 

llllhllll llllll IMIIIIIMIII II MMMIIIIMII II Mllllll 
159 CTRFYSAEIVCALEYLHQKGI IHRDLKPENILLSEEMHIQITDFGTAKQLSSDSKQARAN 218 



241 SPVGTAQYVSPELLTEKSACKSSDLWALGCI IYOLVAGLPPFRAGNEYLIFQKI I KLEYD 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllh 
219 S FVGTAQYVS PELLTE KSACKS SD LMALGC 1 1 YQLVAGLP PFRAGNEYL I FQKI I KLEYB 278 

301 FPEKFFPKARDLVEKLLVLDATKRIXSCEEMEGYGPLKAHPFFESVTWENI^ 360 
llllllllhlllhll Ih Ihlhll lllllhlllllll h:|l MINIM 



Db 


379 


Qy 


361 


Ob 


339 


0y 


417 


Db 


399 


Qy 


477 


Ob 


459 


Qy 


537 


Db 


S19 



379 FPEKFFPKAKDLVEQLLSliSCKRIGCDEMGGYGPLKSHPPFESVGTODUi^ 338 



llllllllllllllllhlllll III 



'I I i I 



1 = 11111 11111 = 1 MM II Ih 1 1 1 1 1 J 1 1 1 1 1 1 1 M 1 1 II M II 1 1 1 M I U 



lllllllllllllllllllllllll IMIIIIIIIIIIIIIIIIIIIIIIIIIII III 



=1111111= I I 



RESULT 15 
Q9BROS HUMAN 

ID Q9BRDS_HUMAN PRELIMINARY; PRT; 429 AA. 
AC Q9 BROS; Q8IVS2; 

DT 01-JUN-2001. integrated into Uni ProtKB/TrEMBL . 
01 -MAR -2004, sequence version 2. 
07-PEB-2006, entry version 29. 

3-phosphoinositide dependent protein kinase-1 (PDPK1 protein). 
Name-PDPKl; 
Homo aapiens (Human) . 

Eukaryota; Metazoa; Chordata; Craniate; Vertebrate; Euteleostorai; 
Mammalia; Butheria; Buarchontoglires; Primates; Cetarrhini; Hominidae; 



DT 



OC 
OC 



OC Homo. 

OX NCBI TaxID-9606; 

RN (1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUB-Brain, and Uterua; 

RX MEDLINB-22388257; PubMed-1 2477932; DOI-10 . 1073/pnaa . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klauaner R.D.. Colline P.S., Wagner L., Shenmen CM., Schuler G.D. , 

RA Altachul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K. , 

RA Hopkina R.P., Jordan H., Moore T.. Max S.I.. Wang J., Heieh P., 

RA Diatchenko L. . Marusina K. , Parmer A. A., Rubin G.M., Hong L. , 

RA Stapleton N. , Soares M.B., Bonaldo M.P., Caaavant T.L.. Scheetz T.S., 

RA Brownatein M.J., Uadin T.B., Toahiyuki S., Carninci P., Prange C, 

RA Rsha S.S., Loquellano N.A., Peters O.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J. , Malek J. A., Gunaratne P.H., 

RA Richards S. , Worley K.C., Hale S.. Garcia A.M.. Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren B.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G. , 

RA Blakeeley R.W., Touchman J.W., Green B.D., Dickson M.C., 

RA Rodriguez A.C., Orimwood J., Schnutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywineki H.I., Skalska U. , Smallus D.B., 

RA Schnerch A., Schein J.B., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 Cull -length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 



CC 
CC 



[2] 

NUCLEOTIDE SEQUENCE. 
TISSUE-Uterua; 
Director MGC Project; 

Submitted (APR- 2001) to the EMBL/GenBank/DDBJ databases. 
[3] 

NUCLEOTIDE SEQUENCE. 

TISSUB-Brain; 

NIH MGC Project; 

Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
Distributed under the Creative Commons Attribution-NoDeriva License 



DR LinkHub; 



BMBL; BC006339; AAH06339.2; -; mRNA. 
EMBL; BC033494; AAH33494.1; -; mRNA. 
HSSP; 015530; 1H1W. 
SMR; Q9BRD5; 101-231, 282-429. 
Bnsembl; ENSG00000140992 ; Homo sapiens. 

Q9BRD5; -. 
GO; GO:0005S24; P :ATP binding; IEA. 
GO; GO:0000166; F:nucleotide binding; IBA. 
GO; GO:0004672; P:protein kinase activity; IEA. 
GO; 00:0016740; P: tranaf eraae activity; IBA. 
GO; 00:0006468; P: protein amino acid phosphorylation; IEA. 
InterPro; IPR011993; PH_type. 
InterPro; IPR000719; Prot_kinaae. 
InterPro; IPR002290; Ser_thr_pkinaae. 
Pfam; PF00069; Pkinaae; 2. 
ProDom; PD000001; Prot_kinase; 1. 
PROSITB; PS50011; P ROTE I N_KI NASS_DOM ; 1. 
ATP-binding; Kinase; Nucleotide -binding; Transferase. 
SEQUENCE 429 AA; 48201 MW; 860C8A8C06161CS1 CRC64; 



Query Match 74.9%; 
Beat Local Similarity 77.0%; 
Matches 428; Conservative 



Score 2202.5; DB 2; 
Pred. No. 6.5e-136; 
0; Mismatches 1; 



Indels 127; Gaps 



Oy 


l 


Db 


l 


0y 


61 


Db 


61 


Oy 


121 


Db 


110 


Oy 


181 


Db 


110 


Oy 


341 


Db 


114 



lllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



iiimimiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiii 



in 

-TRAN 113 
: I KLEYD 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

! I KLEYD 173 



Oy 
Db 
Oy 
Db 
0y 

Db 

Oy 
Db 
Oy 
Db 



301 FPBKFFPKARDLVEKI^VLDATKRLGCEEMEGYGPLKAHPFFESVTWENLHQQTPPKLTA 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
174 F PE KFFP KARDLVBKLLVLOAT KRLGCBEMEGYGPLKAH P FPESVTWBNLH QOTP P KLTA 233 

361 YLPAMSEDDS0CYGNYDNLLS0FGCM0VS SSS S SHS LSASDTGLPQRSGSN I EQY I HDLD 420 

III Ml MM II I IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

234 YLPAMSEDDEDCYGNYDNLLSQ FGCK QVS S SS S SHSLSASDTGLPQRSGSN I EQY I HDLD 293 

421 SNS FELDLQFSEDEKRLLLE KQAGGNFWHQFVENNL I LKMG PVDKR KGLFARRRQLLLTB 480 

294 tJjisJilwsmU' 1 "^ 353 

481 GPHLYYVDPVNICVLKGE I PWSQELRPEAiGff CTFFVHTPNRTYYLMDPSGNAHKWCRKI 0 540 

lllllllllllllllllllllillllllllllllllllllllllllllllllllilllll 
354 GPHLYYVDPvmVLKCBIPWSQBLRPEAKNFKTPFVTn-PNRTYYLMDPSGNAHKWCT^ 413 

541 EVWRQRYQSH PDAAVQ 556 

IIIIIIIIIIIIIMI 
414 EVWRQRYQSH PDAAVQ 429 



Search completed: June 27, 2006, 01:36:19 
Job time : 302 sees 



GonCoro version 5.1.9 
Copyright (c) 1993 - 2006 Bioccoleration Ltd. 



OM protein - protein search, using &•* model 



Title: 

Perfect score: 
Sequence : 

Scoring table 
Searched: 



June 26, 2006, 23:30:12 ; Search time 196 Seconds 
(without alignments) 
1297.002 Million cell updates/sec 

US-10-689-S76-1 
2941 

1 MARTTSQLYDAVP I QSSWL RKI QEVWRQR YQSH PQAAVQ 556 



BLOSVJK62 

i 10.0 



Gapext 0.S 

2589679 seqs, 457216429 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum OB seq length: 2000000000 



tssing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 sum 



geneseqpl980s:* 
geneseqpl990s : • 
geneseqp2000s : * 
geneseqp2001s : * 
geneseqp2002s : • 
geneseqp2003as : 
geneseqp2003bs : 
geneseqp2004s : • 
geneseqp200Ss: * 



Pred. No. is the number of results predicted by chance to have a 
•core greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 

No. 



Query 

Score Match Length DB 



ID 



Description 



2941 100.0 

2941 100.0 

2941 100.0 

2941 100. 0 

2941 100.0 

2941 100.0 

2941 100.0 



556 2 AAW71738 

556 2 AAY27055 

556 3 AAB2B445 

556 3 AAB28446 

556 3 AAY94735 

SS6 6 ABO07176 

556 6 ABU0470B 



Aaw7173B Human 3-p 
Aay27055 Human pro 
Aab2B445 Human PDK 
Aab28446 Human PDK 
Aay94735 Phosphoin 
Abo07176 Human p53 
Abu04708 Human exp 













* 


ABU04718 


auuwi no numan exp 




2941 


100 




556 




ABU04712 


AMvKii numan exp 


10 


2941 


100 








ABU04716 


wwi i io numan exp 




2941 


100 




556 




ABU04711 


Abu 04711 Human exp 


12 


2941 


100 




556 


6 


ABU04706 


Abu 04 706 Human exp 






100 


0 




$ 


ABU04714 


Abu04714 Hunn exp 


14 


2941 


100 


, o 


556 


6 


A8U04707 


Abu04707 Human exp 


15 


2941 


100 


,0 


556 


6 


ABU04717 


Abu 04 717 Human exp 


16 


2941 


100 


. o 


556 


7 


ABM79012 


Abm79012 Human pho 


17 








556 


7 


ADD44919 


Add44919 Human Pro 


IB 


2941 


100 , 


. 0 


556 


7 


ADD44915 


Add449l5 Human Pro 










556 


7 


ADD899B3 


Add8 9983 Human can 


20 


2941 


100 


0 


556 


8 


ADI36055 


Adi36055 Human pho 


21 


2941 


100. 


, 0 


556 


6 


AD01546S 


Adol5465 Human PDP 


22 


2941 


100 . 


, o 


556 


B 


ADQ19234 


Adql9234 Human not 


23 


2937 


99. 


. 9 


556 


2 


AAY0S779 




24 


2889. 5 


98. 


, 2 


599 


8 


ABM84519 


Abm84519 Human dia 


25 


2638 


96 . 


. 5 


535 


4 


AAB99823 


Aab99823 AGC prote 


26 


2838 


96. 


. 5 


535 


6 


ABU04713 


Abu04713 Human exp 


27 


2787. 5 


94 . 


a 


559 


7 


ADD44917 


Add44917 Rat Prote 


28 


2787. 5 


94 . 


. g 


559 


7 


ADD44913 


Add44913 Rat Prote 


29 


2682 


91 . 


, 2 


S06 


2 


AAY05780 


Aay05780 Human pro 


30 


2682 


91 , 


, 2 


506 


6 


ABU04715 


Abu04715 Human exp 








2 


462 


10 


AEF60044 


Aef 60044 Human tru 


32 


2202*5 


74 , 




468 


6 


ABU04719 




33 


2202 5 










ABU0470S 


Abu04705 Human exp 




1B93 


64 




361 


9 


ADY8553S 


AdyB5535 Catalytic 


35 


1747 


59. 


4 


335 


4 


AAB99847 


Aab99847 AGC prote 


36 


1747 


59. 


4 


335 


8 


ADJ3B895 


Adj38895 PDfCl amin 


37 


1710 


58. 


1 


319 


6 


ABU04709 


Abu04709 Human exp 


38 


1710 


58. 


1 


319 


6 


ABU04720 


Abu04720 Human exp 


39 


1513 


51. 


4 


289 


B 


ADJ3886S 


Adj38S65 PDK1 amin 


40 


1486 


50. 


5 


285 


6 


ABRS7461 


AbrS7461 AGC famil 


41 


1144. S 


38. 


9 


752 


4 


ABBS7927 


Abb57927 Drosophil 


42 


1144.5 


38. 


9 


752 


4 


ABB67127 


Abb67137 Drosophil 


43 


1144.5 


38. 


9 


752 


4 


ABBS77B2 


Abb577B2 Drosophil 


44 


1144.5 


38. 


9 


755 


4 


ABB67126 


Abb67126 Drosophil 


45 


973 


33. 


1 


S39 


4 


ABB67129 


Abb67129 Drosophil 



ALIGNMENTS 

RESULT 1 
AAW71738 

ID AAW71738 standard; protein; 556 AA. 
XX 

AC AAW7173B; 
XX 

DT 10-DBC-1998 (first entry) 
XX 

DB Human 3 -phosphoinosi tide-dependent protein kinase. 
XX 

KW Protein kinase B-alpha; 3 -phosphoinosi tide -dependent protein kinase; 
KM diabetes; cancer; cell proliferation; phosphorylation. 



Homo sapiens. 



W09841638-A1. 
24-SEP-199B. 

16-MAR-199B; 98WO-GB000777 . 



17-MAR-1997; 
19-JUN-1997; 
15 -AUG- 1997; 
03-OCT-1997; 



97GB-00005462. 
97GB-O0012B26. 
97GB-O0017253. 
97US-00943667. 



(MEDI-) MEDICAL RES COUNCIL. 



OR WPI; 1998-531572/45. 

DR N-PSDB; AAV61037. 
XX 

PT New isolated 3 -phosphoinosi tide -dependent protein kinase - which 

PT phosphorylatea and activates protein kinase B-alpha, used to develop 

PT products for treating diabetes or cancers or for enhancing cell 

PT proliferation. 

XX 

PS Example 2; Pig 10; 120pp; English. 
XX 

CC A pure 3 -phosphoinosi tide- dependent protein kinase (3PDPK) that 

CC phosphorylates and activates PK B-alpha has been isolated. The present 

CC sequence represents human 3PDPK. Products from the present invention 

CC (e.g. 3PDPK, nucleotide sequence encoding 3PDPK, antibodies against 

CC 3PDPK) can be used to identify compounds which modulate the PK activity 

CC e.g. for treating diabetes or cancers or for enhancing cell proliferation 

CC in the regeneration of nerves or in wound healing 

XX 

SQ Sequence 556 AA; 

Query Match 100.0%; Score 2941; DB 2; Length 556; 

Best Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



iiiiiiiiHiiMiiiniiiiiiiiiiiit iiiiiinii him 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiinii 



IIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIII Illlllllllll 

CTRPYTABIVSALBYUiGRQI IHRDLKPENILLKKDMHI0ITDFGTAKVLSPESKQARA1J 240 
241 S PVCTAQYVS PBLLTSKSACKSSDLMALGC 1 1 YQLVAGL P PFRAGNEYL I FQKI I KLBYD 300 





l 


Db 


l 


oy 


61 


Db 


61 


oy 


121 


Db 


121 


Oy 


181 


Db 


181 


oy 


241 



Db 241 SFVOTAQWSPEU/TE KSACKSSDLWALGCI I YQLVAGLPPFRAGNBYLI FQK1 I KLBYD 300 

Oy 301 FPBKFFP KARDLVE KLLVLDATKRLOCE EMEGYG P LKAH P F FKSVTWENLH OQTP P KLT A 360 

niimiimiiiiiiiiiiiiiiimiiiimiiiiiiiiiiiiiiiMiiiiM 

Db 301 FPEKFFP KARDLVE KLLVLDAT KRLG CE EMEGYG P LKAH P FFRSVTWENLHQQT PP KLTA 360 

Oy 361 YLPAMSEDDEDCYGNYDNLLSQFGCMQVS SSSSSHS LSASDTGLPQRSGSN I EQY I HDLD 420 

IIIIIIIIIIMIIItlllMIIMIIIIIIMIIII IIIIIMIIIIIIIIMIIIMI 

Db 361 YLPAMSEDDEDCYGNYDNLLSQ FG CMQVSS S S SSHS LSASDTGLPQRSGSN I EOY I HD LD 420 

Oy 421 SNSFELDLQFSEDEKRLLLBKQAGGNPWHQFVENNL I LKMGPVDKRHGLPARRRQLLLTE 480 

lllilllllllllllllll Illlllllllllllllllllllllllllllllllll 

Db 421 SNSPBLDLOFSBDSKRLLLEKQAGGNPWHQFVENNL I LKMGPVDKRKGLFARRRQLLLTS 460 

Oy 4B1 G PHLYYVDPVNKVL KGEIPWSQSLRP EATOiFICTF FVHTPNRTYYLMD PSGNAH KWCRKI 0 540 

MIMIII IIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllllll llllll 

Db 481 G PHLYYVDPVNKVL KG E I PWSQB LRP EAKNF KTF FVHTPNRTYYLMD PSGNAHKWCR K I Q 540 

Oy 541 EVWRQRYQSH PDAAVQ 556 

llllllllllllltll 
Db 541 EVWRQRYQSHPDAAVQ 556 



RESULT 2 
AAY27055 

ID AAY27055 standard; protein; 556 AA. 
XX 

AC AAY270S5; 
XX 

DT OB-OCT-1999 (first entry) 
XX 

DB Human protein kinase (HPKM) -4 (clone ID 472480). 
XX 

KW Human protein kinase molecule; HPKM; human; protein kinase; 

KW phosphate group; cancer; immune disorder. 

XX 

OS Homo sapiens. 
XX 

PN W09938981-A2. 
XX 

PD 05-AUO-1999. 
XX 

PF 12 -JAN- 1999; 99WO-US000661 . 
XX 

PR 30-JAN-1998; 98US-00016000 . 



( INCY -) INCYTB PHARM INC. 



PI 
PI 
XX 



Bandman 0, Hillman JL, Lai P, Akerblora IB, Shah P, Corley NC; 
Guegler KJ; 

WPI; 1999-479190/40. 
N-PSDB; AAX89B53. 

New human protein kinase molecules useful for treating or preventing 
cancer or an Immune disorder. 



PS Claim 1; Page 66-69; 77pp; English. 
XX 

CC The invention provides human protein kinase molecules (HPKM) (AAY27052- 

CC 57) and nucleic acid sequences (AAX89850-S5) encoding the HPKM 

CC polypeptides respectively. The HPKM polypeptides can be produced 

CC recombinantly by standard recombinant methodology. Protein kinases add 

CC phosphate groups to proteins. HPKM polypeptide or an HPKM antagonist is 

CC used to treat or prevent cancer or an immune disorder. The present 

CC sequence represents the amino acid sequence of HPKM -4 

XX 

SQ Sequence 556 AA; 

Query Match 100.0%; Score 2941; OB 2; Length 556; 

Beat Local Similarity 100.0%; Prod. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Oy 1 MARTTSOLYCAVPIOSSVVlXZSCTSPSMVRTITreSSTPPGIPGGSROGPAMIXn-AAKPRP 60 

llllllillllllllllllllllllllltllllllllllllllllllllltllllllllt 
Db 1 MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTQTESSTPPG I PGGSRQaPAMDGTAABPRP 60 

Oy 61 GAGSLQHAQPPPQPRKKRPEDFK^KIL^EGSFSTVVIARSLATSREYAI KILEKRHI I K 120 

MIIIIIIIIIIMMIIMIMMIMMMIIIIMIIMMIMIIIIMIIIIMI 
Db 61 GAGSLQHAQ PPPQ PR KKRPEDFKFG K I LC BGS FSTWLARBLATSRBYA I KX LE KRH 1 1 K 120 

Qy 121 ENKVPYVTRERDVMSRLDH P FFVKL Y PTFQDD 5 KLY FGLSYAKNGE LLKY I RK I G S FD ET 160 

HIM I III lllllll IMI III III I IIMII I INIIMII I IIMII llll III I II 

□b 121 ENKVPY^RERDVMSRI^HPFFVKXVFTFQDDBKXYFGLSYAKNGBIXKYIRKIGSFDST 1B0 

Oy 181 CTRFYTABIVSAI^YLKGKGI IHRDLKPKNILLhreDWIQITDFGTAKVLSPBSKXlARAN 240 

IIIIIIIIIIIIIIHIIIIIIIMIIIIIIIIIIII III I II I III III I III III I II 

Db 181 CTRPYTAB I VSALBYLHGKG 1 1 HRDLKPENI LLNEIJMHIGITDF^ 240 

Oy 241 SFVGTAQYVSPBLLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

II llllllllll lllllll MM 111 I IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 241 SFVGTAQYVSPELLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

Oy 301 F PE KFFPKARDLVE KLLVLDAT KRLGCEEMEG YG PL KAH P PFE SVTWENLHQQT P P KLTA 360 

llllllllllllltlllllilll Illlllllllllllllllllllllllllllll 

Db 301 F PBKF FPKARDLVB KLLVLDAT KRLG CE EMEGYG PL KAH P FFESVTWENLHQQTP P KLTA 360 

Oy 361 YLPAMSEDDEDCYGNYDNLLSQFGCMQVS S SS SSHSLSASDTQ LPQRSGS N I EQY I HD LD 420 

Mllllllllllllllilllllll tltllllltlltlillillllllllltllll 

Ob 361 YLPAMSEDDEDCYG^m)KLLSQFGCMaVSSSSSSHSLSASiyrGLPaRSGSNIEOYIHDLD 420 

Oy 421 SNSFELDLQFSEDSKRLLLEKQAGGNPWHQFVENNLILKMGPVDKRKGLFARRRQLLLTE 480 

II III lllllll lllllll I III III I III III MM III I III III III I III III I II 

Db 421 SNS FELDLQFSED SKRLLLE KGAGGNPWHQFVENNL I LKMG PVD KRKG LFARRRQLLLTE 480 

Oy 481 GPHLYYVDPVNKVLKGE I PWSQELRPEAKNFKTPFVHTPNRTYYLMDPSGNAHKWCRKI Q 540 

IMMMIMM I IIMII I III MM III III MM III 1 1 1 1 1 M 1 1 1 1 1 M 1 1 M 1 1 

Db 481 GPH LYYVD PVNKVLKG E I PHSQELRPEAKKFKT F FVHTPNRTYYLMD PSGKAH KWCRK I Q 540 

Oy 541 BVWRQRYQSH PDAA VQ 556 

llltllltlllltlll 
Db 541 BVWRQRYQSH PDAAVQ 556 



RESULT 3 
AAB28445 

ID AAB28445 standard; protein; 556 AA. 
XX 

AC AAB28445; 
XX 

DT 01-FBB-2001 (first entry) 
XX 

0B Human PDK-1. 
XX 

KM Human; PDK-1; 3-phosphoinositide dependent protein kinase* 1; 

KM antisenae oligonucleotide; phosphorothioate; antiinflammatory; 

KM cytostatic; antimicrobial. 



OS 
XX 



Homo sapiens. 
US6 124 272 -A. 



26-SBP-2000. 



PF 09-APR-1999; 99US-002B9466 . 
XX 

PR 09-APR-1999; 99US-00289466 . 
XX 

PA (ISIS-) ISIS PHARM INC. 
XX 

PI Monia BP, Cowsert LM; 
XX 

DR WPI; 2000-611015/58. 

DR N-PSDB; AAC60590. 
XX 

PT Novel antisense conpounds useful for inhibiting the expression of human 3 

PT -phosphoinositide dependent protein kinase-1, useful e.g. for treating 

PT inflammation, tumors and infections. 
XX 

PS Disclosure; Col 4S-4B; 41pp; English. 
XX 

CC The nucleotide sequence encoding the present protein was used to design a 

CC large number of antisense oligonucleotides which are targeted to a 

CC nucleic acid molecule encoding human 3-phosphoinositide dependent protein 

CC kinase-1 (PDK-1). The antisense compounds may be oligodeoxynucleotides or 

CC chimeric oligonucleotides containing a central gap region, consisting of 

CC ten 2 • -deoxynucleotides, which ia flanked on both sides by 2'- 

CC methoxyethyl (2' -MOB) wings. The oligonucleotides have a phosphorothioate 

CC backbone. The antisense oligonucleotides are useful for inhibiting the 

CC expression of human PDK-1 in human cells or tissues. They are also useful 

CC for preventing or delaying infection, inflammation or tumours and are 

CC useful for research and diagnostics 

XX 

SQ Sequence 556 AA; 

Query Match 100.0%; Score 2941; DB 3; Length 556; 
Best Local Similarity 100.0%; Pred. No. 2.3e-249; 

Ketches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Oy 1 MARTTSQL YOAVP I QSSWLCS CP S PSMVRTQTES STP PG I PGGSRQGPAMDGTAAB P RP 60 



IIIIIIIIIIIMMIIIMIMIIIII MIMIIIIIMIIMI IIMIIMIMIIM 

Db 1 MARTTSQL YDAVP I QSSWLCS CP S PSMVRTQTESSTP PG I PGGS RQGPAMDGTAAEPRP 60 

Qy 61 GAGSLQHAOPPPQPRKKRPEDFKFGKI LGEGSFSTVVLARELATSREYAI KILEKRH 1 1 K 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 f 1 1 1 1 j 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 GAGSLQHAOPPPQPRKKRPEDFKFGKILGEGSFSTVVLARELATSREYAIKILEKRHIIK 120 

Qy 121 EMKVPYVTRERDVMSRLDH PF FVKLY FTFQDDE KLY FG LSYAKMG B LLKY IRKIGSFDET 180 

IIIIMM lllllll IMIIMIII IIMII IMIIIMilMIMIMIIMIMIIM 

Db 121 ENKVPYVTRE RDVMS RLDH P FFVKLYFTFQDDS KLY FGLSYAKNGE LLKY I RKI GSFDBT 180 

Qy 181 CTRFYTAE I VSALBYLHGKG 1 1 HRDLKPENI LLNEDMH I QI TDFGTAKVLS PES KGARAN 240 

IIIIMMMIIIIIIMIIIIIIIIIIMIIIIIMIIIMII IIMMMIMIIMI 

Db 181 CTRFYTAE I VSALBYLHGKG 1 1 HRDLKPENI LLNEDMH IQI TDFGTAKVLS PBSKQARAN 240 

Oy 241 SFVGTAQYVSPELLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

MIMMMMMMMMMMMMMMMIMMMMM MMMIMMMMI 

Db 241 SFVGTAQYVSPBLLTEKSACKSSDLHALGCI I YQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

Qy 301 FPEKFPPKARDLVEKLLVIJ1ATKRLGCEEMEGYGPLKAHPFFBSVTWENLHQQTPPKLTA 360 

IIIIMIMIMIIIIMIIIIIIIMIMI IIIMIMMIIMMMIIMMIIMI 

Db 301 FPBKFFPlGUu^LVBlG^Lvl/MTKRI^CEEMEGYGPLKAHPFFESVTWSNLHQQTPPKLTA 360 

lb !!! 111111111^^ «o 

Qy 421 SNSFELDLQFSBDE KRLLLB KQAGGNPWHQFVENNL I L KMG PVD KRKGLFARRRQLLLTE 480 

Mill IIIIMM III III I III III llll III I lllllll III IIMIIIIMMI Ml 

Db 421 SNSFELDLQFSBDE KRLLLE KQAGGNPWH Q FVENNL I LKMGPVD KRKGLFARRRQLLLTE 480 

Oy 481 GPHLYYVD PVNKVLKGE I PWSQBLRP BAKNF KT F FVHT PNRTYYIUDPSGNAH KWCRKI Q 540 

MMIMMMMM 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 GPHLYYVDPVNKVLKGKI PWSQELRPEAXNFKTFFVHTPNHTYYLMDPSGt-IAH KWCRKI Q 540 

Oy 541 BVWRQRYQSH PDAAVQ 556 

I I I I t I I t I I I I I I I I 
Db 541 BVWRQRYQSH PDAAVQ 556 



RESULT 4 
AAB28446 

ID AAB28446 standard; protein; 556 AA. 



AAB2S446; 
01-FEB-2001 
Human PDK-1 . 



(first entry) 



PDK-1; 3-phosphoinositide dependent protein kinase-1; 
antisense oligonucleotide; phosphorothioate; antiinf laamatory; 
cytostatic; antimicrobial. 

Homo sapiens. 

US6124272-A. 



26-SEP-2000. 



09-APR-1999; 99US-00289466 . 



PR 09-APR-1999; 99US-002B9466 . 
XX 

PA (ISIS-) ISIS PHARM INC. 



Monia BP, Cowsert LM; 



WPI; 2000-611015/58. 
N-PSDB; AAC60591. 



PT Novel antisense compounds useful for inhibiting the expression of human 3 

PT -phosphoinoaitide dependent protein kinase-1, useful e.g. for treating 

PT inflammation, tumors and infections. 
XX 

PS Disclosure; Col 49-52; 41pp; English. 
XX 

CC The nucleotide sequence encoding the present protein was uaed to design a 

CC large number of antisense oligonucleotides which are targeted to a 

CC nucleic acid molecule encoding human 3-phosphoinositide dependent protein 

CC kinaae-1 (PDK-1). The antiaenae compounds may be oligodeoxynucleotides or 

CC chimeric oligonucleotides containing a central gap region, consisting of 

CC ten 2' -deoxynucleotides, which is flanked on both sides by 2 ' - 

CC methoxyethyl (2* -MOB) wings. The oligonucleotides have a phosphorothioate 

CC backbone. The antisense oligonucleotides are uaeful for inhibiting the 

CC expression of human PDK-1 in human cells or tissues. They ere also useful 

CC for preventing or delaying infection, inflammation or tumours and are 

CC useful for research and diagnoatics 

XX 

SQ Sequence 556 AA; 

Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 556; Conservative 0 



Score 2941; DB 3; Length 556; 
Pred. No. 2.3e-249; 

Mismatches 0; Indels 0; 



oy 


1 


Db 


1 


Oy 


61 


Db 


61 


Oy 


121 


Db 


121 


Oy 


181 


Db 


181 


Oy 


241 


Db 


241 



1 1 M 1 1 M I II 1 1 II 1 1 1 1 1 1 1 M 1 1 M I M J 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 U 1 1 1 1 1 1 



iiimiiiMmiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiii 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 M 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiii 



Oy 
□b 

oy 
□b 
oy 

Db 
Oy 
Db 



301 FPKKFFPKAJU)LVKKXLVU^TKRLGC^S>^YGPLK3UiPFreSVTWEm^QOTPP»CLTA 360 

IIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIllllllllllllll 
301 FPB KFFP KARD L VB KLLVLDAT KRLGCEEMEOYO PLKAH PP PBSVTWENLHQQT P P KLTA 360 

361 YLPAMSEDDEDCYGNYDNLLSQFGCHOV SSSSSSHSLSASDTGLPQRSGSN I BOY I KDLD 430 

llllllllllillllllllillllllllllllllllltllllllllllllltllllllll 
361 YLPAKSEDEEIX^GNYDNIXSOFGO^SSSSSSHSl^ASimJLPQRSGSNIEQyiHDLD 430 

431 SNSFBLDLiQFSKDBKRLLLBKOAGGNPWHQFVENNL I LK*G PVDKRKGLFARRRQLLLTB 4 BO 

i nun i im i iii i iniiii nii iii mi i iii i iiiiiiiiiiiiii i iiiiiii 

431 SNSFELDLQFSEDB KRLLLE KQAGGNPWH Q FVENNL I L KMG PVD KR KG L F ARRRQ LLLTE 460 

481 G PHLYYVD PVNKVL KDE I PWSQELRPEAKNFKT F PVHT PNRTYYUffi PSGNAH KMCRKI Q 540 

Iflllllllllllllltllllllllllllillllllllllllllllllllllllllllll 
481 G PHLYYVD PVNKVL KG B I PWSQE LR P EAKNF KTF PVHTPNRTYYLMD PSGNAH KW CR K I Q 540 

541 BVWRQRYQSH PDAAVQ 556 

llllllllllllllll 
541 EVWRQRYQSH PDAAVQ 556 



RESULT 5 
AAY94735 

ID AAY94735 etandard; protein; 556 AA. 
XX 

AC AAY94735; 
XX 

DT 29-JAN-2001 (firat entry) 
XX 

DB Phoaphoinoaitide dependent protein kinaae 1 (PDK1) . 
XX 

KM Substrata apecificity; phoaphoinoaitide -dependent protein kinase 1; P0K1; 

KM protein kinaae C related protein kinase 2; PRK2 ; cancer; apoptoaia; 

KM mechanical tieaue damage; iachaeraic disease; stroke; 

KM myocardial infarction; antigenic peptide. 

XX 

OS Unidentified. 
XX 

PN WO200056864-A2. 
XX 

PD 28-SEP-2000. 
XX 

PP 17-MAR-2000; 2000WO-GBQ01004 . 
XX 

PR 19-MAR-1999; 99GB-00006245 . 
XX 

PA (UYDU-) UNIV DUNDEE. 
XX 

PI Aleaai D, Balendran A, Deak M, 
XX 

DR WPI; 2000-647155/62. 



Currie R, Downea P, Caaamayor A; 



Diacloaure; Fig 10; 103pp; English. 

This invention relates to a method for altering the substrate apecificity 
of phoaphoinoaitide-dependent protein kinase 1 (PDK1) , by exposing it to 
an interacting polypeptide. Included in the invention are a preparation 
comprising PDK1 and an interacting polypeptide. PDK1 with altered 
apecificity ia uaaful for phoaphorylating a residue corresponding to the 
Ser/Thr residue of a substrate with the following peptide Phe/Tyr-Xaa-Xaa 
-Phe/Tyr-Ser/Thr-Phe/Tyr. Altered PDK1 ia also used for phoaphorylating 
protein kinaae C related protein kinaae 2 (PRK2) . The compound identified 
by methoda of the invention that are capable of altering the eubetrete 
specificity of PDK1 are uaeful for manufacturing a medicament for 
treating a patient who ia in need of modulation of the insulin signalling 
pathway and/or PDK1, PDK2 or PRK2 signalling . A caopound that ia capable 
of reducing the activity (i.e. the PDK1 and/or the PDK2 activity) of PDK1 
may be uaeful in treating cancer. PDK1, e.g. via protein kinaae B and/or 
SGK, may be capable of providing a aurvival aignal that protects cells 
from apoptoaia induced in a variety of waya. Reduction of the activity of 
PDK1 may promote apoptoaia and may be useful in treating cancer. 
Conditions in which aiding apoptosis may be benefit may alao include 
resolution of inflammation. A compound capable of increasing the activity 
of PDX1 may be uaeful in treating diabetes or obeaity, or may be uaeful 
in inhibiting apoptoaia. Increased activity of PDK1 may lead to increased 
levels of leptin, which may lead to weight loaa. The compounda may 
auppresa apoptoaia, which may aid cell aurvival during or following cell 
damaging proceeeee and in treating diseaae in which apoptoaia ia 
involved. Examples of the diaeaaea Include, mechanical (including heat) 
tissue injury or ischaemic disease, for example atroke and myocardial 
infarction, or neural injury. The present sequence represents a 
phosphoinositide-dependent-protein kinase 1 amino acid sequence 

Sequence 556 AA; 



Query Hatch 100.0%; Score 2941; DB 3; Length 556; 

Beat Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indole 0; 



Oy 
Db 
oy 



Altering substrate apecificity of phoaphoinoaitide-dependent protein 
kinaae 1, to phoaphorylate Ser473 in addition to Thr30 8 by exposing to 
interacting polypeptide. 



Oy 
Db 



1 MARTTSQLYDAVP IQSSVVLCSCPSPSMVRTQTESSTPPG I PGGSROGPAKDGTAAEPRP 60 

1 llUTTSQLY^ 60 

61 GM SLQHAQ P P PQPRKKRP EDP KFG K I LG EGS FSTVVLARELATSREYAI ^ j ^ j j "0 

61 GAGSLQHAQPPPQPRKKRPEDFKFGKI LGEGSFSTWLARELATSREYAI KI LEKRHI I K 120 

121 ENKVPYVTRBRDVMSRU3HPFFVKLYFTFQDDBKLYFGL^AX^ELLKy I RKIGSFDET 1B0 

IMIIIIIIIIIItlllllllllllllllltllllllllllllllllltlllllllllll 
121 BNKVPYVTRERDvXSRLDHPFFvTO,YFTFQDDBKLYFGLSYAKNGELLKYI RKIGSFDBT 180 

181 CTRFYTAEIVSALEYLHGKGI IHRDLKPENI LXWEDKHIQITDFGTAKVLSPESKOARAN 240 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
181 CTRFYTABIVSALSYLKGKGI IHRDLKPENII^JJEDKHIQITDFGTAKVLSPBSKQARAN 240 

241 SFVGTAQYV5PELLTBKSACKSSDLHALGCI I YOLVAGLPPFRAGNEYLI FQKI I KLEYD 300 

IIMIIIIillllfllllllllllilllllllllllllllllllllllllllllllllll 
241 SFVGTAQYVSPELLTEKSACKSSDLHALGCI I YOLVAGLPPFRAGNEYLI FQKI I KLEYD 300 



oy 



oy 
Db 



301 FPBKFFPKARDLVEKLLVLDATKRLGCEEMECTGPLKAHPFFESVTW^ 360 

llltllllllllliltlllflllllltlllllllllllllllllltllltilllllllll 
301 FPE KFFPKARDLVE KLLVLDATKRLGCEBMBGYG P LKAH P FFBSVTWBNLHQOTP P KLTA 360 

361 YL PAMSEDDBDCYGNYDNLLSQ FGCMQV S S S SS SHSLSASDTGL PQRSGSN I E0Y I HOLD 420 

iiiiiiiiiiiiiiiiiiiiiiiiiiiimiiimiiiiiiiiiiMiiiiiiiiiii 

361 YLPAKSEDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTGLPQRSGSN I EQY I HDLD 420 

421 SNSFELDLQFSEDE KRLLLE KQAGGNPWHQFVENNL I LKKG PVD KR KG L FARRRQLLLTE 480 

IlilllllllllllllllllllltllllllllllMttllllllllllllllllllllll 
421 SNSFELDLQF5EDE KRLLLE KQAGGNPWHQFVENNL I LKMG PVD KRKGL FARRRQLLLTE 480 

481 G PHLYYVD PVNKVL KGB 1 PMSQE LRPEAKNFKTFFVHTPNRTYYLilDPSGNAH KMCR KI 0 540 

IMIIIIIIIIIIIIIIIillllllllllllillllllllllllllllllMllllllli 
481 G PHLYYVD PVNKVL KGE I PWSQELRPEAKNFKTFFVHTPNRTYYLMDPSGNAHKMCRKIQ 540 

541 BVWRQRYQSHPDAAVO 556 

llllllllllllllll 
541 EVWRQRYQSH PDAAVQ 556 



RESULT 6 
ABO07176 

ID ABO07176 standard; protein; 556 AA. 
XX 

AC AB007176; 
XX 

DT 13-AUG-3003 (firat entry) 
XX 

DB Human p53 modifying protein, SEQ ID 136. 
XX 

KM Human; p53 modifier; cytostatic; cancer; cytoatatic; antiangiogenic; 
KM antiapoptotic; pS3 pathway; breast cancer; colon cancer; kidney cancer; 
KM lung cancer; ovarian cancer; angiogeneeie; cell cycle; 
KM apoptotic disorder; cell proliferation disorder. 
XX 

OS Homo sapiens. 
XX 

PN WO200299122-A1. 
XX 

PD 12-DEC-2002. 
XX 

PF 03-JUN-2002; 2002KO-US017382 . 
XX 

PR 05-JUN-2001; 2001U5-0296076P. 
PR 10-OCT-2001; 2001U5-0328605P. 
PR 15-FBB-2002; 2002US-03S7253P. 
XX 

PA (BXBL-) BXBLIXIS INC. 
XX 

PI Friedman L, Plowman GD, Belvin H, Francis -Lang H, Li D, Funke RP; 
XX 

DR WPI; 2003-156B59/15. 
DR N-PSOB; ACD133 52. 
XX 

PT Identifying raodulatora of the pS3 pathway for uae in treating apoptotic 
PT or cell proliferation disorders, comprises screening for agenta that 



PT modulate activity of a human ortholog of genea that modify the p53 

PT pathway in Droaophila. 

XX 

PS Example 2; Page 439-440; 67Bpp; English. 
XX 

CC The invention relates to identifying (Ml) a candidate p53 pathway 

CC modulating agent, by contacting an aaaay ayatem comprising a purified HM 

CC polypeptide (human orthologue of genea that modify the p53 pathway in 

CC Droaophila) or nucleic acid with a teat agent under conditions, where but 

CC for the presence of the test agent, the ayatem provides a reference 

CC activity, and detecting a teat agent -biaaed activity of the aaaay ayatem. 

CC Alao included are modulating (M2) a pS3 pathway of a cell (comprising 

CC contacting a cell defective in p53 function with a candidate modulator 

CC that apecifically binds to a HM polypeptide comprising an HM amino acid 

CC sequence, where p53 function ia restored), modulating (M3) a p53 pathway 

CC in a mammalian cell (comprieing contacting the cell with an agent that 

CC apecifically binds an HM polypeptide or nucleic ecid) and diagnosing (M4) 

CC a diaeaae in a patient (comprising: (a) obtaining a biological sample 

CC from the patient; (b) contacting the sample with a probe for KM 

CC expreaaion; (c) comparing the results with a control; and (d) determining 

CC whether the comparison indicates a likelihood diaeaae) . (Ml) ia useful 

CC for identifying modulators of the p53 pathway. A probe for HM expreaaion 

CC ia useful for diagnosing breaat, colon, kidney, lung and ovarian cancer, 

CC in a patient, where the cancer haa greater than 25 % expreaaion level. 

CC Modulators identified by (Ml) are useful in a variety of diagnostic and 

CC therapeutic applications, where diaeaae or disorder prognosis ia related 

CC to defecta in the p53 pathway, auch ae, angiogeneals, apoptotic or cell 

CC proliferation disorders (e.g. cancer). Another two new methoda (M2 and 

CC M3) are uaeful for modulating the pS3 pathway of a cell, thus restoring 

CC the p53 function of the cell, ao that the cell undergoes normal 

CC proliferation or progression through the cell cycle. (M2) and (M3) are 

CC also uaeful for treating defecta in the pS3 pathway such as angiogenic, 

CC apoptotic or cell proliferation disorders. The present sequence 

CC represents a human p53 pathway modifying protein 

XX 

SQ Sequence 556 AA; 

Query Match 100.0%; Score 2941; DB 6; Length 556; 

Beat Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matchea 556; Conaervativo 0; Mismatches 0; Indele 0; Gaps 0; 



iiiiiii iii i Mini Miii iii im iiiiiiiiM i iiiiiii iiiiiii linn 



nillMIMIIIIIIMIIIIIIIIIIIIIIIIIMtlMIIIIIIIIIIIIMIIIII 



IKIIIIIIIHI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

~" * ./YFTFQDD8 KLY FGLSYAKNGELLKY I RKIGSFDBT 180 



M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 



oy 


1 


Db 


1 


Oy 


61 


Db 


61 


oy 


121 


Db 


121 


oy 


181 


Db 


181 



oy 


241 


Db 


341 


oy 


301 


Db 


301 


Oy 


361 


Db 


361 


oy 


421 


Db 


421 


oy 


4B1 


Db 


4B1 


Oy 


S41 


Db 


541 



lilllllllllllllllllllllltlillllllllllllllllllllllllll 



Mil 



llll I IMIIIIIMIIIIIIIII I llllll IIMMII1IIIIIIIIIII IIMIIIII 



llllllllllllllllllllllilllllllllllllllllllllllllllllllllllll 



lllllllllllllllllllllillllllllllllllllllllllllllllllllllllll 



lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



IIIIIIIIIIMIIII 



RESULT 7 
ABU04708 

ID ABU04708 standard; protein; 556 AA. 
XX 

AC ABU0470B; 
XX 

DT 29-JAN-2003 (first entry) 
XX 

DB Human expressed protein tag (BPT) 01374. 
XX 

KW Tranalational profiling; expressed protein tag; EPT; kinase; phosphatase; 

KW protease; protease inhibitor; transporter; cytoskeletal protein; 

KW receptor; transcription factor; cancer; KHC; 

KW major histocompatability complex; myeloma; colon cancer; gastric cancer; 

KW adenocarcinoma; sarcoma; me la; 
XX 

OS Homo sapiens. 
XX 

PN WO2O0278S24-A2. 
XX 

PD 10-OCT-2002. 
XX 

PF 28-MAR-2002; 2 002HO-US009671 . 
XX 

PR 28-MAR-2001; 2001US-0279495P . 

PR 31-MAY-2001; 2001US-0292544P. 

PR 08-AUG-2001; 2001US-0310801P . 

PR 01-OCT-2001; 2001US-0326370P. 

PR 04-DBC-2001; 2001US-0336780P . 

PR 20-FEB-2002; 3002US-03SB985P . 
XX 

PA (2YCO-) ZYCOS INC. 



PI Chicz RM, Total inaon AJ, Urban RO; 
XX 

DR WPI; 2003-040607/03. 



New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 
cytoskeletal proteins, receptors or transcription factors), useful for 
treating cancer, e.g. colon cancer, gastric cancer, aarcoma, lymphoma or 
leukemia. 



PT 
PT 
PT 
XX 

PS Example 2; SBO ID NO 1374; 134pp; Englieh. 
XX 

CC The invention describes a purified polypeptide, which comprises a 

CC fragment of a kinase, phosphatase, protease, protease inhibitor, 

CC transporter, cytoskeletal protein, receptor or transcription factor. The 

CC polypeptide is useful as an immunogenic composition for eliciting in a 

CC mammal an immunogenic response directed againat any of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binds to this 

CC polypeptide, is useful for treating cancer. The polypeptide is also 

CC useful for identifying compounds that binds to a naturally processed 

CC class I or class II KHC- binding polypeptide. The polypeptides and 

CC polynucleotides are particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia. These are also useful for screening agents for 

CC treating the above mentioned diseases. This sequence represents an 

CC expressed protein tag (EPT) isolated from human tissue for tranalational 

CC profiling. Note: Thia sequence does not appear in the printed 

CC specification but was obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/publiehed _pctj 
XX 

SO Sequence 556 AA; 

Query Match 100.0%; 
Beat Local Similarity 100.01; 
Hatches 556; Conservative 0 



Score 3941; DB 6; Length 556; 
Pred. No. 2.3e-249; 

Mismatches 0; Indels 0 



Oy 


l 


Db 


l 


Oy 


61 


Db 


61 


Oy 


121 


Db 


121 


Oy 


181 


Db 


181 


Oy 


241 


Db 


241 


Oy 


301 



IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



liiiiliiiiiiiiiiiiiiiiiiilliliillilllllilillililliiiiiiinii 



I II llll III I MUM UN II I III III I Ml III MM III IMIM MM MM II 



MIMIII IIIIIIIIIIMIIIIIIMIMIIIIIMIIMMIIIIIIIIMIIIIII 



301 PPBKFFPKARDLTCKLLVLDATKRIX>CBEMEGYGPLKAHPFFESVTWENU{0£TPPKLTA 360 



Oy 

Db 

Oy 



oy 

Db 



llllllllllllllllilllMllllllimillllllllllllillllllllllHIl 

301 FPBKFFPKARDLVEKLLVLDATKRLGCEBMBGYGPLKAHPFFESVTWENLHOQTPPKLTA 360 
361 YLPAMSBDDBDCYGNYDtllXSQFGCMOVSSSSSSHSLSASDTGLPQRSGSNIBQYXHDLD 420 

IIMIIllitlllllllllMIIIIIIIIIIIIHIIIIHimillllllllllllll 

361 YLPAMSBDDEDCTGNYDNLLSQFGCKOVSSSSSSHSLSASDTGLPQRSGSNIEQYIHDLD 420 
421 SNSFE LDLOFSEDE KRLLLEKOAGGNPWHQFVBNNL I L KMG PVD KRKGL FARRRQLLLTE 480 

MIMMMMMIMMMMMMMIMMMMMMMIMM IMIMMIMI 

421 SNSFBLDLQFSEDBKRLLLBKQAGGNPWHQFVENNLI LKMGPVDKRKGLFARRRQLLLTE 480 
481 GPHLYYTOPVNKVLKGBIPWSQELRPEAKNFKTFFVTre^ 540 

I MMIIIMM II I llllll MM llllll I Ml Ml III MIMII llll IIMII II 

481 GPHLYYVDPVNKVLKGE I PWSQEIJtPEAKNFKTFFVHTPNRTYYLMDPSGNAHKWCRKIO 540 
541 BVWRQRYQSH PDAAVQ 556 

IIIIIIIIIIMIIII 

541 BVWRQRYQSH PDAAVQ 556 



RESULT 8 
ABU04718 

ID ABU0471B standard; protein; 556 AA. 
XX 

AC ABU04718; 
XX 

DT 29-JAN-2003 (first entry) 
XX 

DB Human expressed protein tag (BPT) #1384. 
XX 

KW Tranalational profiling; expressed protein tag; BPT; kinase 

KW protease; protease inhibitor; transporter; cytoskeletal protein; 

KW receptor; transcription factor; cancer; MHC; 

KW major histocompatability complex; myeloma; colon cancer; gastric 

KW adenocarcinoma; sarcoma; melanoma; lymphoma; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200278524-A3. 



PD 


10 


-OCT 


-2002. 






XX 












PF 


28 


-MAR- 


-2002; 


2002WO-US009671. 


XX 












PR 


28 


-MAR- 


-2001; 


3001US- 


-02794 9SP. 


PR 


21 


-MAY- 


•2001; 


2001US- 


•0393S44P. 


PR 


08 


-AUG- 


•2001; 


2 001 US • 


-0310801P. 


PR 


01 


-OCT- 


•2001; 


2 001 US- 


-0326370P. 


PR 


04 


»DSC- 


-2001; 


2001US- 


•0336780P. 


PR 


20 


-FEB- 


-2002; 


2002US- 


-035898SP. 



XX 
PA 



(ZYCO-) ZYCOS INC. 

Chicz RM, Tomlinson AJ, Urban RO; 



WPI; 2003-040607/03. 



PT New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 

PT cytoskeletal proteina, receptors or transcription factors), useful for 

PT treating cancer, e.g. colon cancer, gaatric cancer, sarcoma, lymphoma or 

PT leukemia. 
XX 

PS Example 2; SEQ ID NO 13B4; 134pp; English. 
XX 

CC The Invention describes a purified polypeptide, which comprises a 

CC fragment of a kinase, phosphatase, protease, protease inhibitor, 

CC transporter, cytoskeletal protein, receptor or transcription factor. The 

CC polypeptide is useful as an immunogenic composition for eliciting in a 

CC mammal an immunogenic response directed against any of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binds to this 

CC polypeptide. Is useful for treating cancer. The polypeptide is also 

CC useful for identifying compounds that binds to a naturally processed 

CC class I or claas II MHC-binding polypeptide. The polypeptides and 

CC polynucleotides are particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia. These are also useful for screening agents for 

CC treating the above mentioned diseases. This sequence represents an 

CC expressed protein tag (EPT) isolated from human tiasue for tranalational 

CC profiling. Note: This sequence does not appear in the printed 

CC specification but was obtsined in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/publiahed_pct_aequencee 

XX 

SQ Sequence S56 AA; 

Query Match 100.0%; Score 2941; DB 6; Length S56; 

Best Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



I II llll I llllll llllll MIMII I III llll II MM MIMII III III llll II 



II III I III llll II I llllll llll II llll llll I llll 1 1 1 1 1 1 1 1 1 1 1 1 1 llll II 



M I llll II III Mill II llll I Mill III II I III llll II IIIIIIIIIIMIIII 



III IIMIIIII llll II I llll II MM II MM llll III III I Mill II IMIIII 



llllll llllllllllllll II III I III III llll Mill IIIIIM I Mill llll II 



M IMIIII Mill 1 1 llllll I III II I III llll II llll llllllllllll llllll 



II I llll II III llll II I III II Mill III II I III llll III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



oy 


i 


Db 


l 


oy 


61 


Db 


61 


oy 


121 


Db 


121 


oy 


1B1 


Db 


181 


Oy 


241 


Db 


241 


oy 


301 


Db 


301 


oy 


361 



Db 


361 


Oy 


421 


Db 


421 


Oy 


401 


Db 


4B1 


Oy 


541 


Db 


541 



361 YLPAMSEDOEDCYGNYDNU^QFGCMOVSSSSSSHSLSASDTGLPQRSGSNIBQYIHDLD 420 




llllllllllllllltllllllllllllllllllllllilllllllllllllllllllll 



llillllllllllllt 



RESULT 9 
ABU04712 

ID ABU04712 standard; protein; 5S6 AA. 



AC ABU04712; 



DT 29-JAN-2003 (first entry) 
XX 

DE Hunan expressed protein tag (SPT) 91378. 
XX 

KW Translational profiling; expreesed protein tag; BPT; kinaae; phoaphataae; 

KN protease; proteaae inhibitor; transporter; cytoakeletal protein; 

KM receptor; transcription factor; cancer; MHC; 

KW major hiatocompatability complex; myeloma; colon cancer; gaatric cancer; 



OS Homo sapiens. 
XX 

PN WO200278S24-A2. 



PD 


10 


•OCT- 


-2002. 






XX 












PP 


26- 


■MAR- 


•2002; 


2002WO 


-US009671. 


XX 












PR 


2B- 


-MAR- 


•2001; 


2001US- 


•0279495P. 


PR 


21- 


-MAY- 


•2001; 


2001US-0292544P. 


PR 


OB 


-AUG- 


•2001; 


2001US 


-0310801P. 


PR 


01 


OCT- 


•2001; 


2001US- 


-0326370P. 


PR 


04 


-DBC- 


•2001; 


2001US 


■O3367B0P. 


PR 


20 


•FEB- 


•2002; 


2O02US- 


•0358985P. 



(ZYCO-) ZYCOS INC. 



PI Chicz RM , Tomlinaon AJ, Urban RG; 
XX 

DR WPI; 2003-040607/03. 
XX 

PT New polypeptides (e.g. kinaaea. phosphatases , proteases, transporters, 

PT cytoakeletal proteins, receptors or transcription factors), useful for 

PT treating cancer, e.g. colon cancer, gastric cancer, 

PT leukemia. 



PS Example 2; ESQ ID NO 1378; 134pp; English. 
XX 

CC The invention describe* a purified polypeptide, which comprisea a 

CC fragment of a kinase, phosphatase, proteeee, proteaae inhibitor, 

CC transporter , cytoakeletal protein, receptor or transcription factor. The 

CC polypeptide is useful as an immunogenic composition for eliciting in a 

CC mammal an immunogenic response directed against any of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binda to this 

CC polypeptide, is useful for treating cancer. The polypeptide is also 

CC uaeful for identifying compound* that binda to a naturally processed 

CC class I or class II KHC-binding polypeptide. The polypeptide* end 

CC polynucleotides era particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia . These ere also useful for screening egents for 

CC treating the above mentioned diaeases. This sequence represents an 

CC expressed protein tag (BPT) isolated from human tiasue for translational 

CC profiling. Note: This sequence does not appear in the printed 

CC specification but was obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/publiahed _pct_sequences 

XX 

SQ Sequence 556 AA; 

Query Hatch 100.0%; Score 2941; D8 6; Length 556; 

Best Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Oapa 0; 



Oy 


1 


Db 


1 


Oy 


61 


0b 


61 


Oy 


121 


Db 


121 


Oy 


181 


Db 


181 


Oy 


241 


Db 


241 


Oy 


301 


Db 


301 


Oy 


361 


Db 


361 


Oy 


421 


Db 


421 



Mill III MUM I II MMIMIilll MM III 111 IMIIMtl MINIUM I 



Mill MUM MM II I III Ml IMIMMI III I IMIIIMI IIMM I II MM I 



IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIItlllllllllllllillllllll 



1 1 1 1 1 1 1 1 1 • i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J I i I F 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 1 1 



MMMMMIMMIMIMIMMMMMMMMMMMMIMMMMIMM 



IMIIMIIMMIIIMMMIIIIIMIIIIIIMIIIIIMMMIIMMIIIMI 



linn iiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiitiiiiiii 



illinium iimiimmiiiiiiiiiimiiiiiiiiimiiiii 



Oy 481 GPHLYYVD PVNKVLKGB I PWSQSLRP EAKNFKTF PVHT PNRTYYLMD PSGNAH KWCRKI Q 540 

1 1 1 1 1 1 1 1 1 1 r r 1 1 1 1 1 1 1 1 j 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 GPHLYYVD PVNKVLKGB I PWS0ELRPEAKN7KTFFVHTPNRTYYLMDPSGNAH KWCRKI Q 540 

Oy 54 1 EVWRQR YQSH PDAAVQ 556 

1 1 1 M 1 1 M 1 1 1 1 1 1 1 

Db 541 EVWRQRYQSHPDAAVQ 556 



RESULT 10 
ABU04716 

ID ABU04716 standard; protein; 556 AA. 
XX 

AC ABU04716; 
XX 

DT 29-JAN-2O03 (first entry) 
XX 

DE Human expressed protein tag (BPT) «13B2. 
XX 

KW Translational profiling; expressed protein tag; BPT; kinaae; phosphatase; 
KW protease; protease inhibitor; transporter; cytoakeletal protein; 
KW receptor; transcription factor; cancer; KHC; 

KW major hietocompatability complex; myeloma; colon cancer; gastric cancer; 

KW adenocarcinoma; sarcoma; melanoma; lymphoma; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO20027BS24-A2. 
XX 



PD 


10 


-OCT 


•2002. 






XX 












PF 


28 


-MAR- 


•2002; 


2002WO 


-US009671. 


XX 












PR 


28 


-MAR- 


-2001; 


2001US 


-02794 95P. 


PR 


21 


MAY- 


-2001; 


2001US 


-0292544P. 


PR 


0B 


-AUG- 


•2001; 


2001US 


-0310B01P. 


PR 


01 


-OCT- 


•2001; 


2001US 


•0326370P. 


PR 


04 


-DEC- 


■2001; 


2001US-0336780P. 


PR 


20 


-FEB-2002; 


2O02US 


-03S8985P. 



XX 

PA (ZYCO- ) ZYCOS INC. 
XX 

PI Chicz RM. Tomlinson AJ, Urban RG; 
XX 

DR WPI ; 2003-040607/03. 
XX 

PT New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 
PT cytoskeletal proteins, receptors or transcription factors), useful for 
PT treating cancer, e.g. colon cancer, gaatric cancer, aarcoma, lymphoma or 
PT leukemia. 
XX 

PS Example 2; SEQ ID NO 1382; 134pp; English. 
XX 

CC The invention describee a purified polypeptide, which comprises a 

CC fragment of a kinase, phosphatase, protease, protease inhibitor, 

CC transporter, cytoskeletal protein, receptor or transcription factor. The 



CC polypeptide is uaeful as an immunogenic composition for eliciting in a 

CC mammal an immunogenic response directed against any of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binda to this 

CC polypeptide, is useful for treating cancer. The polypeptide is also 

CC useful for identifying compounds that binds to a naturally processed 

CC claes I or class II MHC -binding polypeptide. The polypeptides and 

CC polynucleotides are particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia. These are also uaeful for screening agents for 

CC treating the above mentioned diseases. This sequence repreaenta an 

CC expreaaed protein tag (BPT) isolated from human tissue for translational 

CC profiling. Note: This sequence does not appear in the printed 

CC apecif ication but waa obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence S56 AA; 



Query Match 100.0%; Score 2941; DB 6; Length 556; 

Beat Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gapa 0; 



oy 


1 




60 


Db 


1 


MARTTSQLYDAVPIQSSVVLCSCPSPSMVRTCTESSTPPOI PGGSRQOPAMDGTAAEPRP 


60 


oy 


61 


GAGS LQHAO PP PQPR KKR P ED FKFGKI LGEGSPSTWLARELATSREYA I KI LB KRH 1 1 K 


120 






iiiimmiiimiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiii 




Db 


61 


GAGSLQHAQPPPQPRKXRPED FKFGKI LGEGSPSTWLARELATSREYA I KI LB KRH 1 1 K 


120 


Oy 


121 


ENKVFYVTRBRDVXSRLDHPFFVKLYFTFQDDEIGVYFGL^ 


180 






iiiiiiiiiiiiiimiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiii 




Db 


121 


ENKVPYVTRERDVMSRI^HPFFVKLYPTFQDDEKLYFGLSYAKN^ 


160 


Oy 


181 


CTRFYTAB I VSALEYLHGKG I IHRDLKPB«1LUJEDMHIQIT0FGTAKVLSPBSKQARAN 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTRFYTAB I VSALEYLHGKG I I HRDLKPBNI LLNEDKH IQI TDFOTAXVLSPBSKQARAN 


240 


Oy 


241 


SFVGTAQYVSPELLTSKSACK5SDLMALGCI I YQLVAGLPPPRAONBYLI FQKI IKLEYD 


300 






Mill Ml 1 IIIMI IIIIIIIIIMMIMIM MIMMIMM MM MMIIMI 1 




Db 


241 


S FVGTAQYVS PELLTB KSACKS SD LKALGC 1 1 YQLVAGLPPFRAONBYLI FQKI IKLEYD 


300 


Oy 


301 


PPEKFFPKARDLVEKLLVL0ATKRLGCBEMEGYGPLKAHPFFESV7VEN 


360 






inn mi imii ii mill iniii mi Mini mi mi mi iii hum 




Db 


301 


PPSKFFPKARDLVBlOAVLDATiaUiGCSBMEtJYaPLKAHPPFBSV™ 


360 


Oy 


361 


YLPAMS ED D EDCYGNYDNLLSQ POCKQVSSSS SSHS LSASDT3LPQRSGSN I EQY I HOLD 


420 






miiimiiiiimiiiiiiiMiiiiimiiiiiiiiiimiiiniiiiiiii 




Db 
Oy 


361 
421 


YLPAMS EDDEDCYGNYDNLLSQFCOfQVSSSSSSHSLSASDTGL PQRSGSN I EQY I KDLD 


420 






mimmiwm^ 


480 


Db 
Oy 


421 
481 


SNS FELDLQPSEDBKRLLLB KQAGGNPWKQFVENNL I LKMG PVD KRKGLFARRRQLLLTB 


480 








540 


Db 


481 GPHLYYVDPWnCVLKGEI PWSQKIJIPEAXNPKTPPVHT 


540 



541 EVWRQRYQSHPDAAVQ SS6 

llllllllllllllll 
541 BVWRQR YQSH PDAAVQ 556 



ABU04711 standard; protein; 556 AA. 
ABU04711; 

29-JAN-2003 (tint entry) 

Hunan expressed protein tag (BPT) 81377, 

Translational profiling; expressed protoin tag; SPT; kinase; phosphatase; 
protease; protease inhibitor; transporter; cytoskeletal protein; 
receptor; transcription factor; cancer; MHC; 

major histocorapatability complex; myeloma; colon cancer; gastric cancer; 
adenocarcinoma; sarcoma; melanoma; lymphoma; leukaemia. 

Homo sapiens. 

WO200278524-A2. 



PD 


10-OCT-2002. 






XX 








PF 


2S-MAR-2002; 


2002WO 


-US009671. 


XX 








PR 


2B-HAR-2001; 


2001US 


-02794 95P. 


PR 


21-MAY-3001; 


2001US-0292544P. 


PR 


OB-AUG-2001; 


2001US 


0310801P. 


PR 


Ol-OCT-2001; 


2001US 


•0326370P. 


PR 


04-DSC-2001; 


2001US 


0336780P. 


PR 


2O-FSB-20O2; 


20O2US-0358985P. 



XX 
PA 



(ZYCO-) ZYCOS INC. 

Chi« RM. Tomlinson AJ, Urban RO; 
WPI; 2003-040607/03. 

New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 
cytoskeletal proteins, receptors or transcription factors) , useful for 
treating cancer, e.g. colon cancer, gastric cancer, sarcoma, lymphoma or 
leukemia. 

Example 2; SEQ ID NO 1377; 134pp; English. 

The invention describes a purified polypeptide, which comprises a 
fragment of a kinase, phosphatase, protease, protease inhibitor, 
transporter, cytoskeletal protein, receptor or transcription factor. The 
polypeptide is useful aa an immunogenic competition for eliciting in a 
mammal an immunogenic response directed against any of the purified 
polypeptide. The purified polypeptide, or the antibody that binds to this 
polypeptide, is useful for treating cancer. The polypeptide is also 
useful for identifying compounds that binds to a naturally processed 



class I or class II KHC-binding polypeptide. The polypeptides and 
polynucleotides are particularly useful for creating or preventing 
myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 
lymphoma or leukaemia. These are also useful for screening agents for 
treating the above mentioned diseases. This sequence represents an 
expressed protoin tag (BPT) isolsted from human tissue for translational 
profiling. Note: This sequence does not appear in the printed 
specification but was obtained in electronic format directly from WIPO at 
f tp . wipo . i nt /pub/ pub 1 i shed _pct_sequencee 



556 AA; 

Query Match 100.0%; 
Best Local Similarity 100.0%; 
Hatches 556; Conservative I 



Score 2941; DB 6; 
Pred. No. 2.3e-249; 
; Mismatches 0; 



oy 

Db 

oy 



oy 

Db 
Oy 
Db 
Oy 
Db 
Oy 
Db 
Oy 
Db 
Oy 
Db 
oy 

Db 



1 MARTTSQLYDAVP I QSSWLCSCPSPSMVRTQTESSTPPG I PGGSRQGPAKDGTAABPRP 60 

II! I II III lllllllll III llllllllllllllll II Iflllllll III I MIIMII 

1 MARTTSQLYDAVP I OS SWLCS CPS PSMVRTQTBS ST PPG I POGSRQG PAXDGTAAE PRP 60 

61 GAGSLOilAQPPPQPRKKRPEDFKFCXILGEGSFSTVVLARELATSREYAIKJLEKRHI IK 120 

tllllllllllllltllllllllllllllllllllllllllltlllllllllllllllll 
61 GAGSLQHAQPPPOPRKKRPEDPKFGXI LGEGSPSTWLARBLATSRBYA I KI LBKRH 1 1 K 120 

121 ENKVPYVTRBRDVMSR1JDHPFFVKLYFTFQDDHKLYFGLSYAJ<1JGBIXKYIRKIGSFDET 160 

llllllllllllllllllllllllllllilllltllltilllllllllllllllllltll 
121 ENKVPYVTRERDVMSRI^H PFPVKL YFTFC£DE KLYFGLSYAKNGELLKY I RKIGSFDET 180 

181 CTRFYTAEIVSALEYLHGKDI IHRDLKPENIIJ^NEDMHIQITDFOTAKVLSPSSKOARAN 240 

llltlllllltlilllllllllllllllllllllllllllllllilllltllllllllll 
181 CTRPYTAEIVSALBYLHGKOI IKRDLKPENILLNEDMHIQITDFGTAXVLSPESKQARAN 340 

241 SFVGTAQYVSPELLTEKSACKSSDLWALGCI IYQLVAGLPPFRAGNBYLIFQKI I KLEYD 300 

Ml I II Ml I III lllll I II MM II I MINI IIIIIIIIIMI II MUM III III 

241 S FVOTAQYVS PE LLTE KSACKS SDLHALGC I IYQLVAGLPPFRAGNEYLIFQKI I KLEYD 300 
301 FPEKFFPKARDLVEKlXVimTKRU3CEEMEGYGPLKAHPFFESVTWENI^Q0TPPKLTA 360 

I III II I Ml M Ml MM Ml MUM lllllllll IN 1 1 1 1 1 1 1 1 1 1 1 MM Mill 

301 FPE J07FP KARDLVE KLLVLQATKRLGCBBMEGYG PLKAH P FFBSVTWENLHQQTP P KLTA 360 
361 YLPAMSEDD BDCYGNYDNLLSQFGCMOV SS S SS SHSLSASDTGLPQRSGSN I EQY I HDLD 420 

i r r 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 j j i ti 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 

361 YLPAMS EDD EDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTGLPQRSGSNI EQY I HDLD 420 
421 SNS F E LD LQ PS ED E KRLLLE KQAGGNPWH Q FVENNL I L KMG PVD KR KG LF ARRRQ LLL TE 480 
421 SNSFBLDLQFS ED E KRLLLE KC^xJlIVH QFVENNL I LKMG PVD KKKELFARRRQLLLTB 48" 
481 GPHLYYVDPVNKVLKGBI PWSQBLRPBAKNFKTFPVHTPNRTYYXjMDPSGNAHKWCRKJQ 540 

1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 r 1 1 

481 GPHLYYVDPVNKVLKGBI PWSQBLRPSAntFKTFFVHTPNRTYYLMDPSGNAHKWCRKIQ 540 

541 BVWRQR YQSH PDAAVQ 556 

llllllllllllllll 
541 BVWRQR YQSH PDAAVQ 556 



RESULT 12 
ABU04706 

ID ABU04706 standard; protein; 5S6 AA. 
XX 

AC ABU04706; 
XX 

DT 29-JAN-2003 (first entry) 
XX 

DE Human expressed protein tag (BPT) #1372. 
XX 

KM Translational profiling; expressed protein tag; BPT; kinase; phosphatase; 

KW protease; protease inhibitor; transporter; cytoskeletal protein; 

KW receptor; transcription factor; cancer; MHC; 

KW major histocompatability complex; myeloma; colon cancer; gastric cancer; 



leukaemia. 



KW adenocarcinoma; aan 
XX 

OS Homo sapiens. 
XX 

PN WO200278524-A2. 
XX 

PD 10-OCT-2002. 
XX 

PP 28-MAR-2002; 2002WO-US009671 . 
XX 

PR 28-MAR-2001; 2001US-027949SP. 

PR 21-MAY-2001; 2001US-0292544P . 

PR 08-AUG-2001; 200 1US-03 10S01P . 

PR 01-OCT-2001; 2001US-0326370P. 

PR 04-DBC-2001; 2001US-0336780P. 

PR 20-FEB-2002; 2002US-035898SP. 
XX 

PA (ZYCO-) ZYCOS INC. 
XX 

PI Chicz RM, Tomlinson AJ. Urban RG; 
XX 

DR WPI; 2003-040607/03. 
XX 

PT New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 

PT cytoskeletal proteins, receptors or transcription factors) . useful for 

PT treating cancer, e.g. colon cancer, gastric cancer, sarcoma, lymphoma or 

PT leukemia. 
XX 

PS Example 2; SBQ ID NO 1372; 134pp; English. 
XX 

CC The invention describes a purified polypeptide, which comprises a 

CC fragment of a kinase, phosphatase, protease, protease inhibitor, 

CC transporter, cytoskeletal protein, receptor or transcription factor. The 

CC polypeptide is useful as an immunogenic composition for eliciting in a 

CC mammal an immunogenic response directed against sny of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binds to this 

CC polypeptide, is useful for treating cancer. The polypeptide is also 

CC useful for identifying compounds that binds to s naturally processed 

CC class I or class II KHC-binding polypeptide. The polypeptides and 

CC polynucleotides sre particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma. 

CC lymphoma or leukaemia. These are also useful for screening agents for 

CC treating the above mentioned diseases. This sequence represents an 



CC expressed protein tag (BPT) isolated from human tissue for tranalational 

CC profiling. Note: This sequence does not appear in the printed 

CC specification but was obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/publiahed _pct_sequences 

XX 

SO Sequence 556 AA; 

Query Match 100.0%; Score 2941; DB 6; Length 556; 

Beet Local Similarity 100.0%; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Oy 


l 


Db 


l 


Oy 


61 


Db 


61 


Oy 


121 


Db 


121 


Oy 


181 


Db 


181 


Oy 


241 


Db 


241 


Oy 


301 


Db 


301 


oy 


361 


Db 


361 


Oy 


421 


Db 


421 


oy 


481 


Db 


481 


Oy 


541 


Db 


541 



I III III III III III III II lllllllll MM II III III lllllllll III Mil II 



IIIIMIMIMIIMMilllllMIMIIMIIIIMIIMM 



llllllllllllll 

IIK 120 



I M I II I I I I II II I II II I I II MIIMIIMMMIIIIMMMIMI 

EN KVPYVTRERDVMSRLDHPFFVKLYFTFQDDEKXYfGLSYAKNGELLKY I RKIGSFDET 180 



II MM lllll III llllllllllllllll II III llllllllllllllllllllllll I 



I III II I III III Ml II III I III II I II III I II I II I III II I II III I II I II I II 



IIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIII 



I Ml Ml II MM II I II MM II III MM III II I II MM III 111 II I III II I II 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 r f 1 1 1 1 1 1 1 1 1 

1KRLLLBKQAGONPWHQFVBNNLILKMGPVDKR103LFARRRQLLLTB 480 



IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMII 

— ------ --- ITYYLMDPSGNAHKWCRKIQ 540 



IIIIIIIMIIMM 



RESULT 13 

ABU04714 

ID ABU04714 

XX 

AC ABU04714; 



itsndard; protein; 556 AA. 



DT 29-JAN-2003 (first entry) 
XX 

OB Human expressed protein teg (BPT) 01380. 
XX 

KM Trans lational profiling; expressed protein tag; BPT; kinase; phosphatase; 

KM protease; protease inhibitor; transporter; cytoskeletal protein; 

KM receptor; transcription factor; cancer; MHC; 

KM major histocompetability complex; myeloma; colon cancer; gastric cancer; 

KM adenocarcinoma; sarcoma; melanoma; lymphoma; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200278524-A2. 
XX 

PD 10-OCT-2002. 
XX 

PF 2B-KAR-2002; 2002WO-US009671 . 
XX 

PR 28-MAR-2001; 2001US-0279495P. 

PR 21-MAY-2001; 2001US-0292544P. 

PR OB-AUG-2001; 2Q01U5-0310801P. 

PR 01-OCT-2001; 2001US-0326370P. 

PR 04-DEC-3001; 2001US-0336780P. 

PR 20-FBB-2OO2; 2002US-035898SP. 
XX 

PA (ZYCO-) 2YCOS INC. 
XX 

PI Chicz RM. Tomlinson AJ. Urban RG; 
XX 

OR MPI; 2003-040607/03. 
XX 

PT New polypeptides (e.g. kinases, phosphatases, proteasea, transporters, 

PT cytoskeletal proteins, receptors or transcription factors) . useful for 

PT treating cancer, e.g. colon cancer, gaatric cancer, sarcoma, lymphoma or 

PT leukemia. 



Example 2; SEQ ID NO 1380; 134pp; English. 

The invention describes a purified polypeptide, which comprises a 
fragment of a kinase, phosphatase, protease, protease inhibitor, 
transporter, cytoskeletal protein, receptor or transcription factor. The 
polypeptide is useful as an immunogenic composition for eliciting in a 
mammal an innunogenic response directed against any of the purified 
polypeptide. The purified polypeptide, or the antibody that binds to this 
polypeptide, is useful for treating cancer. The polypeptide is also 
useful for identifying confounds that binds to a naturally processed 
class I or class II MHC-binding polypeptide. The polypeptides and 
polynucleotides are particularly useful for treating or preventing 
myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 
lymphoma or leukaemia. These are also useful for screening agents for 
treating the above mentioned diseases. This sequence represents an 
expressed protein tag (BPT) isolated from human tissue for translational 
profiling. Note: This sequence does not appear in the printed 
specification but was obtained in electronic format directly from WIPO at 
f tp.wipo. int/pub/published_pct_sequences 



60 Sequence 556 AA; 

Query Match 100.0%; 
Best Local Similarity 100.0%; 
Matches 556; Conservative 0 



Score 2941; DB 6; 
Pred. No. 2.3o-249; 
Mismatches 0; 



Oy 


l 


Db 


l 


oy 


61 


Db 


61 


oy 


121 


Db 


121 


oy 


161 


Db 


181 


oy 


241 


Db 


241 


Oy 


301 


Db 


301 


oy 


361 


Db 


361 


Oy 


421 


Db 


421 


Oy 


461 


Db 


461 


Oy 


S41 


Db 


541 



llllllilllllllllllltlllllllllllllllllllllllllllllllllfllllli 



llltlllllllllllllltllillllllilllllllllllllllltlllllllllllMI 



llllilllllllllllltllllllllllltllllllltllllllililllllltllllll 



lllllllllllllllllltllllllllllllllllllllltllllllllllliltlllll 



ii iii iii imi ii mi imiii iii imi him i in Mini niiiii Minn 



II III IN MM II I III llllll Ml MM II Ml I Ml III III IIIMM lllllll 



lllllltlllllllllllllllltlllllllllllllllltlllllllllilllllllll 



r 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 m i f j i r 1 1 1 1 m 1 1 1 j 1 1 1 1 1 r 1 1 1 1 [ 1 1 1 1 1 

SNSFBLDLOFSBDSKRLLLBKOAOaNPHHQFV^NNLILKMGPVOKRKOLFARRRQU^TB 4 60 



1 1 1 1 1 1 1 1 1 r 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



iiiiiiiimiiiii 



RESULT 14 
ABU04707 

ID ABU04707 atandard; protein; 5S6 AA. 



ABU04707; 

29-JAN-2003 (first entry) 

Human expressed protein tag (EPT) #1373. 



Translational profiling; expressed protein tag; EPT; kinase; phosphatase; 
protease; protease inhibitor; transporter; cytoskeletal protein; 
receptor; transcription factor; cancer; MHC; 

major histocompetability complex; myeloma; colon cancer; gastric cancer; 
adenocarcinoma; sarcoma; melanoma; lymphoma; leukaemia. 



XX 




oy 


61 


OS 


Homo sapiens. 






XX 




Db 


61 


PN 


WO200278524-A2. 






XX 




Oy 


121 


PD 


10-OCT-2002. 






XX 
PF 


28-MAR-2002; 2002WO-US009671 . 


Db 


121 


XX 




Oy 


181 


PR 


28-KAR-2001; 2001US-0279495P. 






PR 


21-KAY-2001; 2001US-0292544P. 


Db 


181 


PR 


0S-AUG-2O01; 2001US-0310801P. 






PR 
PR 


01-OCT-2001; 2001US-0326370P. 
04-DEC-2001; 2001US-0336780P. 


Oy 


241 


PR 


20-FEB-2002; 2002US-0358985P. 


Db 


241 


XX 








PA 


(ZYCO-) ZYCOS INC. 


Oy 


301 


XX 






PI 


Chicz RM, Tomlinson AJ, Urban RG; 


Db 


301 


XX 








DR 


WPI; 2003-040607/03. 


oy 


361 


XX 






PT 


New polypeptides (e.g. kinases, phosphatases, proteases, transporters, 


Db 


361 


PT 


cytoskeletal proteins, receptors or transcription factors), useful for 







PT treating cancer, e.g. colon cancer, gaatric cancer, sarcoma, lymphoma or 

PT leukemia. 
XX 

PS Example 2; SEQ ID NO 1373; 134pp; English. 
XX 

CC The invention describes a purified polypeptide, which comprises a 

CC fragment of a kinase, phosphatase, protease, protease inhibitor, 

CC transporter, cytoskeletal protein, receptor or transcription factor. The 

CC polypeptide is uaeful as an immunogenic composition for eliciting in a 

CC ma anal an immunogenic response directed against any of the purified 

CC polypeptide. The purified polypeptide, or the antibody that binds to this 

CC polypeptide, is useful for treating cancer. The polypeptide la also 

CC useful for identifying compounds that binds to a naturally processed 

CC class I or class II MHC-binding polypeptide. The polypeptides and 

CC polynucleotides are particularly useful for treating or preventing 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia. These are also useful for screening agents for 

CC treating the above mentioned diseases. This sequence represents an 

CC expressed protein tag (BPT) isolated from human tissue for translational 

CC profiling. Note: This sequence does not appear in the printed 

CC specif ication but was obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published_pct sequences 

XX 

SO Sequence 556 AA; 

Query Match 100.0%; Score 2941; DB 6; Length 556; 
Best Local Similarity 100.0%; Pred. No. 2.3e-249; 

Ketches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Oy 
Db 
oy 

Db 

Oy 



1 MARTTSQLYDAWIQSSVVlX^CPSPSMVTlTOTBSSTPPaiPGGSROOPAMDGTAABPRP 60 

II III Ml MUM MM III III III III I M I IMIII MIMI III I IMIII MM 

1 MARTTSQL YDAVP I QSSWLCS CPS PSMVRTQTESSTPPO I POQSRQO PAMDGTAAB P RP 60 
GAG5LQHAQPPPQPRKKRPEDFKFGKILGEGSFSTVVLARELATSREYAI KILBKRHI I K 120 

llllllllllltlllMlllllllllllllllllllllllllllllllllltlllllilt 

GAGSLQKAO PP PQPRKKR PED FKFO KI LGEGS FSTWLARE LATS REYA I KI LE KRH I IK 120 
ENXVFYVTRERDVMSRLDH P PFVKL YFTFQDDBKLY FGLSYAKNGBLLKY I RKI GS FDET 180 

IIIIIIMIIIIIIIIIIIIIIitlllMIIIIIIIIHIMIIIIIIIIimilllil 

ENKVP YVTRBRDVMSRLDH P P FVKL YFTFQDDB KLYFOLSYAKNGBLLKY I RKI GS FDET 1 8 0 
CTRFYTAEIVSALEYLHGKGI IHRDLKPENI LLJJEDMH IQITDFGTAKVLSPBSKQARAN 240 

MMIMMIMMMMMMMMMMMMMIMIMMMIMMMMIMM 

CTRFYTAEIVSALEYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAKVX>SPBSKQARAN 240 
SFVGTAQYVSPBLLTEKSACKSSDLHALGCI IYQLVAGLFPFRAGNBYLI PQKI I KLBYD 300 

1 1 1 1 1 1 1 1 L 1 1 1 f M 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 

S FVGTAQYVSPBLLTEKSACKSSDLWALGCI I YQLVAGLPPFRAGNBYL I FQKI I KLEYD 300 
F PE KFFPKARDLVE KLLVLDATKRLGCEEMEGYGPLKAH P PPESVTWENLHQGTPPKLTA 360 

MMMMMMMMMMMMMMMMMMMIMMIMMMMMMMM 

F PB KPFPKARD LVEKLLVLDAT KRLGCBEMEGYG PLKAH P FFESVTWENLHQQTPP KLTA 360 
YU'AMSBDDBDCYGNYDNLLSQFGCMQVSSSS SSHSLSASDTGLPQRSGSN I EQY IHDLD 420 

MMIMMIMMMMMMMMMMMMMIMMIMMMMMIMIMM 

YLPAMSSDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTGLPQRSGSN I EQY IHDLD 420 
SNS FELCLQFSSDBKRLLLE KQAGGNPWHQFVENNL I LKMG PVDKRKGLFARRRQLLLTB 480 

MIIMIIIIIMMIMIIIIIIIIIIMIIMIMIIIMMIIMIIIIMIMIII 

SNS FELDLOFSBDE KRLLLE KQAGGNPWHQ FVENNL I LKMG PVDKRKGLFARRRQLLLTB 480 



421 
421 
481 
481 



GPHLYYVDPVNKVLKGS I PWSQSLRPEA}OJFKTFFVHTPNRTYYLMDPSGKAHKWCRKIQ 540 

1 1 1 1 1 1 ! 1 1 1 r 1 1 1 1 1 IMIMMMMMMMMMMMMMMIMM 

GPHLYYVD PVNKVLKGE I PWSQEIJtPEAXNFKTPPVHTPNRTYYLMDPSGNAHKWCRKI Q 540 



541 BVWRQR YQSH PDAAVQ 556 

II MIMMIII IMI 

541 BVWRQR YQSH PDAAVQ 556 



RESULT IS 
ABU04717 

ID ABU04717 standard; 
XX 

AC ABU04717; 



protein; 556 AA. 



DT 29-JAN-2003 (first entry) 
XX 

DB Human expressed protein tag (BPT) A13B3. 
XX 

KM Translational profiling; expressed protein tag; BPT; kinase; phosphatase; 

KW protease; protease Inhibitor; transporter; cytoskeletal protein; 

KM receptor; transcription factor; cancer; MHC; 

KM major hiatocoopatability complex; myeloma; colon cancer; gastric cancer; 

KW adenocarcinoma; sarcoma; melanoma; " 







Oy 


61 


GAGSLQHAQPPPQPRKKRFEDPKFGKI LGEGSPSTWLARBLATSRBYAI KI LEKRH 1 1 K 


120 


OS 


Homo sapiens . 




1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

IIIMllllllllMlllfiilllirilllllllllllllMIMllllllttMMIII 




XX 




Db 


61 


GAGS LQHAQPP PQPRKXRP ED PKFG KI LGEGSFSTWLARBLATSRBYAI KI LEKRH 1 1 K 


120 


PN 


WO200276S24-A2. 










XX 




Oy 


121 


ENKVPYVTRERDVMSRLDH FFFVKLYFTFQDDE KL Y FGLSYAKNG ELLKY I RKI GSFDET 


1B0 


PD 


lO-OCT-2002 . 






ill ii i inn i ill 1 1 him ii nun i mi inn mini i milium ii 

1 1 1 1 1 1 ! 1 ! 1 1 1 1 1 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 1 




XX 




Db 


121 


ENKVPYVTRBRDVKSRLDHPPI'Via^YPTFOTDB^ 


180 


pp 


2B-MAR-2002; 2002WO-US00 967 1 . 










XX 




oy 


181 


CTRPYTAB I VSALBYLH0 HQ 1 1 HRD LKPENI LLNEDMH 1 0 1 TDFQTAXVLSPBSKQARAN 


240 


PR 


28-HAR-2001; 200 1US - 027 94 9SP . 






1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MMIMlMllIllllllltllllMllfllllMIIJIIIIIMIJMIMIIMIIII 




PR 


21-KAY-3001; 2001US-0293544P. 


Db 


181 


CTRPYTAB I VSALBYLKG KG I IKRDLKPENILUIEDtQIIOITDFGTAKVLSPBSKQARAN 


240 


PR 


08-AUO-2001; 2001US -03 1080 1 P . 










PR 


Ol-OCT-2001; 20O1US-0326370P. 


Oy 


241 


SFVGTAQWSPBLLTBKSACKSSDLHALGCX I YQLVAGLPP PRAGNBYL I FQKI I KLBYD 


300 


PR 


04-OSC-2001; 3001US-0336780P. 






1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 Ml III 1 III 1 1 1 1 1 1 1 1 1 

1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ \ \ 1 1 r I 1 1 1 1 \ 1 1 1 1 | \ \ \ \ \ \ | | \ \ \ \ \ \ | \ 




PR 


20-PB8-2002; 2002US-03S898SP. 


Db 


241 


SFVaTAQYVSPBLLTSKSACXSSDLWALGCI I YQLVAGLPP PRAGNBYL I FQKI I KLBYD 


300 


XX 
PA 


(ZYCO-) ZYCOS INC. 


Oy 


301 


FPBKFFP KARDLVE KLLVLDATKRLGCEEMEGYG PLKAHPF FESVTVHNLHQQT P P KLTA 


360 


XX 






im III 1 mill 1 III 1 II 1 II! 1 II llll 1 II 1 MINI 111 lllllllll II 1 III 




PI 


Chicz RM, Toralinson AJ, Urban RG; 


Db 


301 


FPBKFFP KARDLVE KLL VIXATKRLGCBEMEGYGPLKAHP F FB5VTWENLHQ0TP P KLTA 


360 


XX 
DR 


WPI; 2003-040607/03. 


Oy 


361 


YLPAMS EDO EDCYGNYDNLLSQFGCKQVS SS S S SH S LSASDTG LPQRSGSU IEQYIHDLD 


420 


XX 








| | | | | | | | | | | || | | | { | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | || | || || | | | 




PT 


Hew polypeptides (e.g. kinases, phosphatases, proteases, transporters, 


Db 


361 


YLPAMSEDDEDCYGNYDNLLSQFGCMQVSS S SSSHSLSASDTGL PQRSGSN IEQYIHDLD 


420 


PT 


cytoskoletal proteins, receptors or transcription factors), useful for 










PT 


treating cancer, e.g. colon cancer , gastric cancer, sarcoma, lymphoma or 


Oy 


421 


SNS FELDLQFSEDEKRLLLE KQAGGNPHHQFVENNL I LKMG PVD KRKGLFARRRQLLLTB 


480 


PT 


1 eukemia . 






1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
MIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIII 




XX 




Db 


421 


SNSFEIJDLQPSEDEK3UXI^K0AGGNPVHQFVENNLIL»« 


480 


PS 


Example 2; SEQ ID NO 1383; 134pp; English. 










XX 




Oy 


481 




540 


cc 


The invention describes a purified polypeptide, which comprises a 






GPHLYYyDPVNKVLKGBIW 




cc 


fragment of a kinase, phosphatase, protease, protease inhibitor, 


Db 


481 


GPHLYYVDPVNKVLKGB I PHSQE LRPEAKNFKT PFVHTPNRTYYLMD PSGNAHKHCRKI 0 


540 


cc 


transporter, cytoskeletal protein, receptor or transcription factor. The 










cc 


polypeptide is useful as an immunogenic composition for eliciting in a 


Oy 


541 


EVKRQRYOSHPDAAVQ 556 




cc 


maomal an immunogenic response directed against any of the purified 






llllllllllllllll 




cc 


polypeptide. The purified polypeptide, or the antibody that binds to this 


Db 


541 


EVWRQRYQSH PDAAVQ S56 





CC polypeptide, is useful for treating cancer. The polypeptide is also 

CC useful for identifying compounds that binds to a naturally processed 

CC class I or class II KHC-binding polypeptide. The polypeptides and Search completed: June 27, 2006, 01:31:14 

CC polynucleotides are particularly useful for treating or preventing Job time : 199 sees 

CC myeloma, colon cancer, gastric cancer, adenocarcinoma, sarcoma, melanoma, 

CC lymphoma or leukaemia. These are also useful for screening agents for 

CC treating the above mentioned diseases. This sequence represents an 

CC expressed protein tag (EPT) isolated from human tissue for translational 

CC profiling. Note: This sequence does not appear in the printed 

CC specification but was obtained in electronic format directly from WIPO at 

CC ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 556 AA; 

Query Hatch 100.0%; Score 2941; DB 6; Length 556; 

Beat Local Similarity 100.01; Pred. No. 2.3e-249; 

Matches 556; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MARTTSQLYDAVP I QSSWLCSCPS PSMVRTQTBS ST P PG I PGGSROG PAMDGTAAE PRP 60 

lllllllllllllllllillllllllllllllllllllllllllllllllllilllllll 
Db 1 MARTTSQLYDAVP I QSSWLCSCP S PSMVRTQTBS ST P PG I PGGSRQGPAMDGTAAE PRP 60 



OH protein • 
Run en: 



GenCore version 5.1.9 
Copyright (c) 1993 - 3006 Bioccoleration Ltd. 



protein search, using sw modal 

June 27. 2006, 01:31:32 ; Search time 42 Seconds 
(without alignments) 
1273.727 Million coll updatas/sec 



Titla: US-10-689-576-1 

Parfact score: 2941 

Sequence: 1 MARTTSQL YDAVP I QSSWL RK1 QBVWRQR YQSH PDAAVQ 556 

Scoring table: 



BLCSUM62 
Gapop 10.0 , 



Gapext 0.5 
96216763 residues 



Totsl number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum OB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Liating first 45 



Database : 



PIR_80:* 
It pirl:* 
2: pir2:« 
3: pir3:« 
4: pir4:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 















138121 




15 


SSI 5 


19 




351 




OKHYCB 


protein kinase (EC 






0 












_ 


5SB5 


19 




351 


l 


OKRTCB 


protein kinase {EC 




558 S 


19 




425 


2 


S41099 


protein kinase (BC 


19 


557*5 


19 








0KB0B1 




20 


557 . 5 


19 


. 0 


371 


2 


T16391 


hypothetical prote 










397 




0KBOB2 


protein kinase (BC 


32 


557 


18. 


, 9 


334 


2 


A60543 


protein kinase (BC 








' ? 






OKBYC3 


protein kinase (EC 


24 




18 




680 






protein kinase YPK 


25 


554 


18 


' - 


380 




0KBYC2 




26 


553 


18 




502 


3 


P31751 


Drot'i" 1 ki""* 


27 


550 5 


18 


' 7 


351 




OKHUCB 


protein kinase *(BC 


28 


540 . 5 


18 


, 7 


312 


2 


T32446 


hypothetical prote 








, 7 


312 


2 


A89460 


protein H42K22.1 ( 


30 


548 


18 


. 6 


375 


2 


T21212 


hypothetical prote 






18 








S22258 


probable protein k 


32 


546.5 


IB , 


. 6 


677 


2 


JS0178 


protein kinase YKR 










351 




0KB02C 


protein kinase (BC 


34 


545.5 


IB , 


s 


1092 


2 


H96509 


protein F27F5.23 [ 


35 


544 . 5 




, 5 


360 


1 


OKHUOG 


protein kinase (BC 






18 












37 


542 


18. 


. 4 


351 


1 


0KHU3C 


protein kinase (BC 


36 


541 


18, 


.4 


351 


1 


OKHYCA 


protein kinase (EC 


39 


541 


IB. 


4 


375 


1 


OKKKC3 


protein kinase (BC 


40 


540.5 


IB. 


4 


569 


2 


T50414 


probable prollfera 


41 


539.5 


IB. 


3 


479 


2 


A38S7B 


protein kinase 2 ( 


42 


539.5 


IB. 


3 


S41 


1 


T43232 


protein kinase (EC 


43 


538 


18. 


3 


351 


1 


OKMSCA 


protein kinase (BC 


44 


538 


18. 


3 


480 


2 


S56639 


ribosomal protein 


45 


537 


18. 


3 


3S1 


1 


OKRT2C 


protein kinase (EC 



ALIGNMENTS 



RESULT 1 
T43402 

probable protein kinase (BC 2.7.1. -> 
C; Species: Schizosaccharoraycee pombe 



fission yeast (Schizosaccharomyces j 



Result 




Query 










C;Date: 21-Jan-2000 ftsequonce revision 21-Jan-2000 fttext change 09-Jul-2004 


No. 


Score 


Match Length DB 


ID 


Description 


C; Accession: T43402; T41425 
















R;Niederberger, C. 


1 


873.5 


29.7 


592 


2 


T43403 


probable protein k 


submitted to the EMBL Data Library, July 1998 


2 


832 


28.3 


488 


2 


T48447 


3-phosphoinositide 


A; Reference number: Z22486 


3 


748.5 


25.5 


1081 


2 


S51899 


probable protein k 


A; Accession: T43402 


4 


733 


24.9 


550 


2 


T40486 


phosphoi nos i t ids - d 


A; Status: preliminary; translated from GB/EMBL/DDBJ 


5 


712 


24.2 


766 


2 


S69657 


hypothetical prote 


A; Molecule type: DMA 


6 


648.5 


22.1 


898 


2 


S69634 


hypothetical prote 


A;Residues: 1-592 <NIE> 


7 


606.5 


20.6 


893 


2 


S63378 


hypothetical prote 


A; Cross -references: UNIPROT:Q12701; UNI PARC : UPI 000006 9B00 ; EMBL:X992B0; 


8 


568.5 


19.3 


648 


1 


JQ1150 


protein kinase (EC 


PIDN:CAA67672.1 


9 


566 


19.2 


471 


2 


S68463 


protein kinase ATP 


A; Experimental source: strain h90 


10 


565.5 


19.3 


320 


2 


T33662 


hypothetical prote 


R;Wood, V.; Rajandream, M.A.; Barrell, B.G.; Murphy, L.; Harris, D. 


11 


565.5 


19.2 


465 


2 


S6B462 


protein kinase ATP 


submitted to the EMBL Data Library, September 1998 


12 


562.5 


19.1 


431 


2 


A48094 


serum and glucocor 


A; Reference number: Z219S4 


13 


561.5 


19.1 


33B 


2 


S000B5 


protein kinase (EC 


A; Accession: T41425 



A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-592 <WOO> 

A; Cross -references: UNI PARC : UPI0000069BOO ; EMBL : AL031798 ; PIDN : CAA21194 . 1; 
GSPDB:GN00068; SPDB : SPCC576 . 15c 

A; Experimental source: strain 972h-; eosmid c576 
C; Genetics: 

A; Gene: KSG1; SPCC576.15c 

A; Map position: 3 

A;Introns: 191/3 

C; Keywords : phosphotransferase 

Query Match 29.7%; Score 873.5; DB 2; Length 592; 

Best Local Similarity 37.3%; Pred. No. 1.3e-29; 

Matches 212; Conservative 89; Mismatches 166; Indels 101; Gsps 17; 

Oy 14 I OS SWLCS CPS PSMVRTQTBSSTPPG I PGGSRQG P AMDGTAAE P RPGAGS LQHAQ P PPQ 73 

I I I « h I : i I : I i | I >|> 
Db 49 IPQSNALNTTPNBSTSOIDSSPKIPSAVP HISTPNPSSGA ST 90 



Oy 74 PRKKRPEDFKFGKILGEGSFSTWLARELATSREYAIKI LEKRH 1 1 KENKVPYVTRKRDV 133 

I II llllhlllllhllh I I :| IMIIhhlllllll I II I:: 
Ob 91 PNI KRVSDFKFGEILGEGSYSTVLTATENSTKREYAI KVLJ3KRHI IKEKKEKYVNIEKEA 150 



Oy 134 MSRLD-HPFFVKLYPTFQDDEKLYFG1^YAJ<1«ELLKYIRKIGSFDETCTRFYTABIVSA 192 

» I II 1*11**1111 III II 1 = 11111 II hi hi I ::| I II l 
Db 151 LC I LS KH PGF I KL FYT FODAHNLYFVLSLARNGELLD Y I NKLG R FNE I CAQYYAAL I YDS 210 

Oy 193 LEYLHGKGI IHRDLKPBNILLNEDMHIOITDFGTAXVLS PESKQARAN 240 

::hlhhlllllllllllh::| Mlllhlhh * II « 

Ob 211 I OYMHGRGV I HRDLKPENI LLDDNMRTKITDFGSAKI LKSSHGSHSEDTHHAD KPQAHSR 270 

Oy 241 SFVOTAOYVSPBLLTSKSACKSSDLHALGCI IYQLVAGLPPFRAGNEYLI FQKI I KLBYD 300 

HlllhllllhhH I :|hll llhH^II III llllll II h I h 
Db 271 SFVGTAR YVS PEVLSD KI AGTASD I WAFGC I LFQMLAG KP P FVAGNBYLT FQS I LHLSYE 330 

Oy 301 FPB KFF P KARDLVE KLLVLDATKRLGCEEMEGYG PLKAH P FFESVTWBN - LKQOTP P KLT 359 

I I I I r r 1 1 1 | 1 1 || :| : |||| : ::| | : ||:| 

Db 331 IPPDISDVASOLIKKLLVLOPKDRLTVDB IHQHPPPNGIKFDNTLWELPPPRLK 384 

Oy 360 AYLPAMSBDDEDCYGNYDNLLSOFGCMQVSSSSSSHS LSASDTGLPQ 406 

• »M « I I =111 I 

Ob 305 PF GHTSVLS LSV7NASNKHBNGDLTSPLGVP5MVSASTNAAPS 427 

Oy 407 RSGSNIE0YIHDLDSNSFSLOLQFS EDBK RLLLEKQAG- -GNPWHQFVENN 455 

h II h ::| III I : =1 II :| . 

Db 428 PVUTFNRGTLLPCOSNLBEENKEWSSILODDEiUSKIGTLNVYS 406 

Oy 456 L I LKMG PVDKRKGLFARR - - RQLLLTB -GPHLYYVD - - PVNKVLKGE IP WSQELRPB 507 

II =1 I :U I =1 I I III I 

Db 487 LFRKRKPRTP I LTNFGRYLCVASDGEGRKTVXEB IPX KSVGMRCRMV 533 

Qy 508 AKNFKTFFVHTPNRTYYLMD PSGNAH KM 535 

I * I II Ihl I I 

Db 534 KNNEHGWWETPTKSHSFEDPNGPASAH 561 



RESULT 2 
T48447 

3-phosphoinositide-dependent protein kinase- 1 PDK1 - Arabidopsis thaliana 

N; Alternate names: protein T32M21.110 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09- Jul -2004 
C;Acceaaion: T48447 

R;Bevan, M.; Terryn, N. j Ardiles, W. ; Buysshaert, C. ; Dasseville, R. ; De Clerck, 
R.; De Keyser, A.; Neyt, P.; Rouze, P.; Van Den Oaele, H. ; Villaroel, R.; 
Gielen, J.; Van Montagu, M.; Bancroft, I.; Mewes, H.W. ; Rudd. S. ; Lemcke, K. ; 
Mayer, K.F.X. 

submitted to the Protein Sequence Database, March 2000 

A; Reference number: Z24487 

A; Accession: T48447 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-488 <BEV> 

A; Cross-references: UNI PROT : Q9LZ74 ; UNI PARC : UPIO00OO9CE94 ; EMBL :AL162875 

A; Experimental source: cultivar Columbia; BAC clone T32M21 

C; Gene tics: 

A; Map position: 5 

A;Introns: 54/3; 115/1; 132/3; 166/3; 203/1; 229/2; 284/3; 329/3; 387/2; 461/3 
A; Note: T32M2 1.110 

Query Match 28.3%; Score 832; DB 2; Length 488; 

Best Local Similarity 40.2%; Pred. No. 5.6e-28; 

Matches 192; Conservative 69; Mismatches 165; Indels 52; Gaps 11; 

Oy 81 DFKFGKILGEGSFSTWLARBLATSREYAI KI LEKRHI IKENKVPYVTRBRDVMSRLDHP 140 

llllll I Ihl II In I Ihlh'h I 1 1 1 1 II II h ;hll 

Db 40 0 FBFGXI YGVGSYS KVVRAKKKBTGTVYALKI MDKKF I TKENKTA YVKLBR I VTJ5QLEH P 99 

Qy 141 FFVKLYFTFODDB KL Y FGLSYAKNGBLLKY I RK1 GS FDETCTRFYTAB I VSALBYLKG KG 200 

= 11111111 III : III I : I I lllllhl lllhl I 
Db 100 G 1 1 KLY FT FQOTSSL YMALE5 CEGGELFDQ I TRKGRLSBOBARPYTABWOALBY I HSMG 159 

Oy 201 1 1 HRDLKPENI LLNBDMHIOITDFGTAK VLS P ESKQARANS FVGTAQYVS PB 2S2 

Mllhlllhll I Ihl Mh I II «l :||||| II || 

Db 160 L I HRD I KPENLLLTSDGH I KI AD PGSVKPMQDSQ I TVLPNAASODKACTFVGTAA YVP PB 219 

Oy 253 LLTEKSACKSSDLMALGCI IY0LVAGLPPFRAGNBYLIFQKIIKLEYOFPBKFFPKAROL 312 

0 I Mlllll >l|i:i| lh 'hlllhll : II I I I II 
Db 220 VLNSS PATFGHD LXALGCTLYQMLSGTS P FKDASEWL I PQR 1 X ARD I KFPNH FSBAARDL 279 

Qy 313 VB KLLVLDATKRLGCEEKEGYG PLKAH P FFESVTWENLHQQTP PKLTAYLPA MSEDD 369 

-II : ::| I III || INI | |:|| llllll III |i 
Db 280 I ORLLDTB PSRR PGAGS - EGYVALKRHP FFNGVDWKNLRSOTP P KL - APOPASOTASPBR 337 

Qy 370 EDCYGNYDNLLSOFGCMOVSSSSSSHS - - - LSASDTGL PQRSG SN I EOY I HDLDSNSFBL 426 

•I «l« I h I i II h :| | I I II 
Db 338 DDTHGSPWN - LTH IG - DSLAT0NEGHSAPPT6SSSSGS I TRLAS IDSFDSR 306 

Oy 427 □ LQPS EDE KRLLLB KQAGGNFWHQFVB - NNL I LKMG PVD KRKGLFARRROLLLTEGPHL Y 485 

I Ihl I I ■ II::::: Ihll I I 
Db 387 WQQPLE PGBSVLM I SA VKKLQKI TSKKVQL I LTNKPKL I 425 

Qy 486 YVDPVNXVLKGBIPW- - -SQELRFEAKNFKTFPVHTPNRTYYIXDPSGNAHKWCRKIQ 540 



4 



III! hll I! hi » I i II i I I I : I : 

Ob 426 YVDPSKLWXENI IWSDNSNDLXfiA/VTSPSHFKICTPKKVI^FEDAKQRASVWKKAIB 483 



RESULT 3 
SS1B99 

probable protaln kinase HRC1081 (EC 2.7.1. -) - yea at ( Saccharomyces cerevieiee) 
N; Alternate names: protein 00784; protein YOLlOOw 
C; Species: Saccharomyces cerevisiae 

C;Date: 05-Mey-199S »sequence_revision 03 -Aug- 1995 #text_change 05-Oct-2004 
C; Accession: S51899; S59175; S66796 

R;Vandenbol, H.j Durand. P.; Portetelle, D.j Hilger, F. 
submitted to the EKBL Oata Library, January 199S 

A; Description: Sequence analysis of a 44kb DNA fragment of yeast chromosome XV 

including the Tyl -H3 retrotransposon, the sufl(»> frameshift suppressor gene for 

tRNA-Gly. the yeast transfer RNA-Thr-la and a Delta. 

A;Reference number: S51B4B 

A;Accession: S51899 

A; Molecule type: DMA 

A; Residues: 1-1081 <VAN> 

A; Cross-references: UNIPROT:012236; UNI PARC: UPI 000004F9FC; EMBL:Z48149; 

NID:g663234; PIDN: CAA88162 . 1; PID:g663254 

R;Vandenbol, M. ; Durand, P.; Portetelle, 0.,* Hilger, P. 

Yeast 11, 1069-1075, 1995 

A; Title: Sequence analysis of a 44 kb ONA fragment of yeast chromosome XV 
including the Tyl -H3 retrotransposon, the sufl(*) frameshift suppressor gene for 
tRNA-Oly, the yeast transfer RNA-Thr-la and a delta element. 
A; Reference number: S591S6; MUID: 96076631; PMID:7S02582 
A;Aceeasion: SS9175 

A;Status: nucleic acid sequence not shown; translation not shown 
A; Molecule type: DMA 
A;Reaidues: 1-1081 <VAH> 

A; Cross-references: UNIPARC:UPI000004P9FC; EMBL:Z48149; NID:g663234; 
PIDN:CAA88162.1; PID:g663254 

A; Note: the nucleotide sequence was submitted to the EMBL Data Library, January 
1995 

R;Durand, P.; Hilger, P.; Portetelle, D. ; Vandenbol, M. 

submitted to the Protein Sequence Database, July 1996 

A; Reference number: S66791 

A; Accession: S66796 

A; Molecule type: DNA 

A; Residues: 1-10B1 <DUR> 

A; Cross-references: UNI PARC :UPI000004F9FC; EMBL:Z74B42; NID:gl419951 ; 

PIDN:CAA99113.1; PID:e251892; PID:gl419952; MIPS:YOL100w 

A; Experimental source: atrain S28SC 

C; Genetics: 

A; Gene: SGD:PKH2 

A; Cross-references: SGD:SOO0S46O 

A; Map position: 15L 

C; Keywords: ATP; phosphotransferase 

P;177-443/Dotnain: protein kinase homology <KIN> 

P;185-193/Region: protein kinase ATP-binding motif 

Query Match 25.51; Score 748. 5; DB 2; Length 1081; 

Best Local Similarity 38.1%; Pred. No. 3.2e-24; 

Matches 164; Conservative 79; Mismatches 109; Indels 79; Gaps 11; 



Oy 76 KKRPEDPKFCKI UJ EG 5 FSTWLARBLATS REYA I K.I LSKRH 1 1 KKNKVPYVTRKRDVMS 135 

II Mill ::hh:H|:|| = I t II hi I n|n II |h h 

Db 173 KKDI KDFKFGSVIGDOAYSTVMLATSIDTKlOlYAAKVLNXBYLIRQKKvXYVS IEKTALQ 232 

Oy 136 RLDH - PFFVXLYFTFQDOEKLYFGLSYAKNGBLLKY I RKI GS FDBTCTRPYTAB I VSALS 194 

«|m I hh lllh III I II Ih I ::| II I 1 1 1 hi hh l-« 
0b 233 IG^SPSVVRLFSTFODESSLYPTXBYAPNGDFLSLMKKYGSLDETCARYYAAOI IDAID 292 

Oy 195 YLHOKCI IHRDLKPEN1LLNEDMHI0ITDFQTAXVLSPB5K QARANSPVOT 245 

III lllllhllllllh i| I t I I : |: HIM 

Db 293 YLHSNGI IHRDIKPENILiJXJEMKIKLTDPGTAKUJJPTNNSVSKPEYDI^TRSKSFVGT 352 

Qy 246 AOYVSPELLTEKSACXSSDLMALGCI IYQLVAGLPPPRAGNEYLI PQKI I KLBYDFPBKF 30S 

hlllllll i hll llh'h'll llhl II 1 1 MhM'M I I 

Db 353 AEYVSPBLLNDSPTDYRCDIHAPOCI LFQMIAGKPPPKATNEYLTFOKVMKVOYAPTPGF 412 

Oy 306 F P KARDLVE KLLVLDATKRIjGCEEMEGYG PLKAH P FPESVTVB - - NLHQQTP P KLTA Y - L 362 

lllhhll ■■ ■■ I I H I Ih i - Mlh = I i 

Db 413 PL 1 1 RDLVKKI LVKNLDRRLTI SO I KEHHPP KDLNPKDGSVWS KT P PE I KPYKI 466 

Oy 363 FAMSBDDEDCYGNYDNLLSQFGOtQVSSSSSSHSL- - -SA5DTG LPQRSGS- -NIB 413 

II II I I I I : I : III I -I 

Db 467 NAKS MQAMPSGSDRKL vXKSVNTLGKSHLVTQRSASS PSVE 507 

Qy 414 QY I HDI^SNSreLCUJFSEDEKRU^KOAGGNPVfHQPVENNL I LKMGPVDK 465 

: I I I I h : =1 I h 

Db SOB BTTHSTLYN NNTHASTESEISIKXRPTDERTAQILEN 544 

Qy 466 -RKGLFARRR0 475 

llh h I 
Db S45 ARKGINNRKNQ 555 



RESULT 4 
T404B6 

phosphoinositide-dependent protein kinase 1 (EC 3.7.1.-) SPBC4C3.il (imported) - 
fission yeast (Schizoseccharomyces pombe) 
C; Species: Schizosaccheromyces pombe 

C;Dato: 2 0 -Oct -2000 »sequence_revision 20-Oct-2000 #text_change 09- Jul -2004 
C; Accession: T40486; T39692 

R;Wood, V.; Rajendream, M.A.; Barrel 1, B.G.; Lauber, J-; Hilbert, H.; 
Duesterhoeft, A. 

submitted to the EMBL Data Library. February 1998 

A; Reference number: Z21910 

A; Accession: T40486 

A; Status: preliminary 

A;Molecule type: ONA 

A; Residues: 1-420 <WOO> 

A; Cross-references : UNIPRQT:Q9Y7J6; UNI PARC : UPI 0000069B50 ; EMBL:AL021730; 
PIDN :CAA16B3 3.1; GSPDB :GN00067; SPDB : SPBC4C3 . 1 1 

R.Oliver, K. ; Harria, D. ; Hood, V.; Rajandream, M.A.; Barrel 1, B.C. 
submitted to the EMBL Data Library, March 1998 
A; Reference number: Z21B69 
A.Acceasion: T39692 

A;Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 385-550 <OLI> 



A; Cross-references: UNI PARC: UPI 000 006A777 ; BMBL:AL049489; PIDN:CAB3 9805 . 1 ; 
OSPDB:GN00067; SPDB :SPBC177B . 10c 

A; Experimental source: strain 972h-; eosmid C1778 
C; Genetics: 

A;Gene: SPBC4C3.il; SPDB : SPBC1778 . 10c 

A; Map position: 3 

C; Keywords : phosphotransferase 

Query Match 24.9%; Score 733; DB 2; Length SS0; 

Best Local Similarity 36.4%; Pred. No. 7.7e-24; 

Matches 185; Conservative 82; Mismatches 165; Indels 76; Gaps 16; 
Qy 77 KRPEDFKFGKILGEGSFSTWLARELATSREYAI KI LBKRHI IKSNKVPYVTRERDVMSR 136 

I I h II hhllll I I • i MHIhhhMMIIII II III I I 

Db 49 KSPADYVFGDI IGDGSPSKNHIRATDKKSWKEYAIKVI^KKYI VKENKVKYVNI KRDSMMR 108 



oy 



137 LD-HPFFVKLYFT FQDD E KL YFG LSYAKNG B LLKY I RKI GS FDBTCTRFYTAE I VSALEY 195 

h I =h Mill llh I I I 1 I I I I I I II I III Ilhh:|| 

109 LNG FPG I SRLFHT FQDD LKL YYVLELAPNGE LLQY I KKYRFLD BNCVRFYAAE I LS S I BY 168 



196 LHGKG I IHRDLKPENILLNBDMHIQITDPGTAKVLSPESKQAR - 
M lllllllllllll : :|h:|||||||hl h 

169 r 



• -ANSPVGTAQYVS 250 

MIMIhlh 

h 228 



Qy 251 PBLLTEKSACKSSDLHALGCI I YOLVAGLPPPRAGNEYLI PQKI I KLEYDFPEKFFPKAR 310 

lllh ■ lllllll |:>||>> I Ml I Ihlh llh h I 
Db 229 PELLSRQWSKSSDLWAFACVVYQMIVGSPPFHGSNPNNIFKKIMSLEYBLPKLLPPDIV 2B8 

Oy 311 DLvBKLLVLDATKRLGCBEMEGYG PLKAH PF FESVTWENLHQQTP PKLTAYLPAMSEDD B 370 

I I : " I Mill ::||:|| Ml « :: I 
Db 289 PLPSHLFRI QPSD RSTTQQ IKQFPFFATITWDNLWTQDPPPMQSFRP 335 

Qy 371 DCYGNYD NLLSQFGCMOVSSSSSSHSLSASDTGLP--QRS 408 

Ih I : I hi I i I II I 

Db 336 NYNIAIPNAPAYYRSNVTAAAAANAAAAF ASASIVKHOBTARRQBLPTVNRP 387 

Oy 409 GSNIEQYIH-DLDSNSFELDLQFSB DB KRLLLE KQA0GNFWHQPVENNL I LKMG PVD 464 

I : I h >l : « = - M II I 
Db 3B8 TAPTAHYOYASLIISHOMPVDRLYYKLVPSSESIIESTSVFVSPIPSVPEGNKF PSO 443 

0y 465 KRKGLFARRRQ- -LLLTE GPH - - LYYVDPvNKVLKGB I PWSQELR PEAKNF 511 

I : I II MM: |||: || || I : 

Db 444 LSK-MFLXRKORVMLLTDVGRCAFVCKGKHERLFIEMEVN- -LKDS SVWIFDENSS 497 

Qy 512 KTFPVHTPNRTYYLMDPSGNAHKWCRKI 539 

||: : | ||: |: II 

Db 498 KRFLIEDKVQSWI IEDSSGDVTKYKDKI 525 



RESULT 5 
S69657 

hypothetical protein YDR490c - yeast (Saccharomyces cerevisiae) 
C; Species: Ssccharomyces cerevisiae 

C;Date: 22-Aug-1996 ftaequenco revision 06-Sep-1996 Ittext change 05-Oct-2004 
C; Accession: S696S7 
R;Dietrich. P.S. 

submitted to the EMBL Date Library, August 1995 



A; Description : The sequence of S. cerevisiae coamids 9410, 8035, B166, and 9787. 
A; Reference number: S695S4 
A; Accession: S69657 
A; Molecule type: DNA 
A; Residues: 1-766 <DIE> 

A; Cross -references : UNIPROT:Q03407; UNI PARC :UPI000006A3C7; EMBL:U33050; 

NID:g927726; PIDN: AAB64917 . 1; PID:g927745; MIPS:YDR490c 

C; Genetics: 

A; Gene: SGD:PKH1 

A; Croaa -references: SGD:S0002898 
A; Map position: 4R 
C; Keywords: ATP 

F;123-391/0omain: protein kinase homology <KIN> 
P;131-139/Rogion: protein kinase ATP-binding motif 



Query Match 24.2%; Score 712; DB 2; 

Best Local Similarity 30.3%; Pred. No. 7.7e-23 
Matches 187; Conservative 87; Mismatches 154; 



Length 766; 

Indels 190; Gaps 15; 



Qy 


27 


Db 


22 


Qy 


68 


Db 


79 


Qy 


96 


Db 


139 


Qy 


1S5 


Db 


199 


Qy 


215 


Db 


259 


Qy 


264 


Db 


319 


Qy 


324 


Db 


379 


Qy 


370 


Db 


433 


Qy 


373 


Db 


493 


oy 


430 



I llh 



I : I : I 



-PAMDGT- 
h I 



AQP - PPQPRKKRPB - ■ 

I MM I 



-AAEPRPGAGSLQH 67 
I II Ml 



• - DFKFGKI LGEGSPST 95 

llllh Ihlhh 



III! 



-IIMM I 



II III 



« Ihllllh 



III I II •!• I hi II MM hi =:h h> II lllllhlllllll 



M -MIIIIIIM I 



-ARANSPVOTAQYVSPELLTSKSACICSS 263 

h= lllllhlllllll I 



hll IIIMhMI llhl Mil MhM:M I I 



MIIMIII I 



NLHQQTPPKLTAY 

MM II I M :: Ih: I I Ml I 

I KAHLFFHEVNFEDGSVWDDNPPEIQPYKIKAEAMKPLQKVSBSDTTV 432 



• \ 



MMI h 



M II hi 



I : I 



Db 544 ERVLLMKBVALSTRNLEDS PVGLENVALDYKNP LDIBPPTDSAGK 568 

Qy 470 FARRRQLLLTEGPHLYYV 487 

I - h I I =1 
Db 569 PYKKMFL I TNLGRALVFV 606 



RESULT 6 
869634 

hypothetical protein YDR466w - yult (Saccharorayce* cereviaiae) 
C; Specie*: Saccharoraycea cereviaiae 

CjDate: 22-Aug-1996 #aoquence_reviaion 06-Sop-1996 #text_change 0S-Oct-2004 
C;Acceaaion: S69634 
R;Dietrich, P.S. 

aubraitted to the EKBL Data Library, Auguat 1995 

A;Deacription: The aequencs of S. cereviaiae coaraida 9410, 8035, 8166, and 9787. 
A; Reference number: S69554 
A;Acceaaion: S69634 
A; Molecule type: DNA 
A;Re*idue*: 1-898 <DIE> 

A; Cross-references: UNIPROT:Q03306; UNIPARC:UPI000006C12B; EMBL:U33050; 
NID:g927726; PIDN : AAB64902 . 1 ; PID:g927730; MIPS;YDR466w 
C;Genatica : 

A;Croaa-r«f erencea : SGD:SQ002874 
A; Hap position: 4R 
C; Keyword!: ATP 

F; 9-293/Domain: protein kinase homology <KIN> 
F; 17-2S/Region: protein kinase ATP-binding motif 

Query Hatch 22.1%; Score 64B.5; DB 2; Length 898; 

Beat Local Similarity 37.7%; Pred. No. 3.8e-20; 

Hatchea 149; Coneervative 60; Mismatches 109; Indole 77; Oapa 10; 

Oy 75 RKKRPEDFKFGKI LGEGSFSTWLARS - LATSREYAI KI LEKRH 1 1 KENKVPYVTRBRDV 133 

th I II i = II U NI | : : :: ||||: I t I I I I II III I- 
Db 4 RKRSPHOFI FKESL/2HGSYSTVFKAIJ3KKSPNKI YAI KVCSKKHI I KKAKVKYVTI EKNT 63 

Oy 134 MSRL DHPFFVKLYFTFQDDEKLYFGUSYAXNGELLKYIRKIGSFDETCTRFYTABIV 190 

h I I 1 1 1 1 1 hi III I . IMM- II = 

Db 64 MNLLAQKHHAG I 1 KLYYTFHDEENLYFVLDFAPGGELLSLLHKMGTFND I WTRH FTAQL I 123 

Oy 191 SALBYLKGKCI IHRDLKFENILLNEDMHIQITDFGTAKVLSP ESKQARA 239 

lll-l MIIIMIII II | . HIM | . | =| 
Db 124 DALEFIHSHOIIHRDLIO>ENVl^RDGRLMITDFGAAATIDPSr^GDSAKFNSDSNGSKD 183 

Qy 240 N SFVGTAQYVSPELLTEKSACKSSDLwALGCI I YQLVAGLPPFRAGNEYLI POKI I 295 

I lllllhlllllll Ihllllhlll I I Mil II I'll* 

Db 184 NQNCASFVOTAEYVS PELLLYNQOGYGSD INALGOI I YOFVOGQPP FRG ENE LKTFE KI V 243 

Qy 296 KLBYDF PEKF FP KARDLVB KLLVLDATKRLGCE EMEOYG P LKAH PFFESVT 346 

hi « I MIMMI- >|> h I I Ihl I 

Db 244 ALDYPWGPNNR1NNSTSPIMPLVINLVQKILVIEVNERISLEQ IKRHPYFSKVD 297 

Qy 347 HENLH QOT P PKLTAYLPAMSEDDEDCYGNYDNLLSQFGCMQV SSS 5 S SHS LSASDTGLPQ 406 

I h II I II : III 

Db 298 WN DKIKIWRGIWQ SQGQSLQQTTLGLP- 324 



Qy 407 RSGSNIEQYI HDLDSNSFBLDLQFSBDSK 435 

II I I | i|i i i i i :| 

Db 325 NIPQNILPTRQLHVIDTPARSIQITKQKRKK 355 



RESULT 7 
S63378 

hypothetical protein YNR047w - yeaat (Saccharoraycee cereviaiae) 
N;Alternate namea : hypothetical protein N3449 
C; Specie*: Saccharooycee cereviaiae 

C;Date: 27-Apr-1996 •*equence_rovi*ion 03-May-1996 * text_change 0S-Oct-2004 
C; Accession: S63378 
R;Pohl. T.H. 

submitted to the Protein Sequence Database, April 1996 
A; Reference number: S63346 
A; Accession: S63378 
A;Molecule type: DNA 
A; Residues: 1-893 <POH> 

A; Crosa- reference*: UNIPROT: P53739; UNIPARC:UPI000012DF15; SHBL:Z71662; 
NID:gl302SS7; PIDN:CAA96328 . 1; PID:e239836; PID:gl302SSB; MIPS:YNR047w 
A; Experimental aource: atrain S288C 
C; Genetic a: 

A; Cro*a - reference* : SGD:S0005330 
A; Hap poaition: 14R 
C; Keyword* : ATP 

F;494-777/Domain: protein kinase homology <KIN> 
F;502-510/Region: protein kinase ATP-binding motif 

Query Hatch 20.6%; Score 606.5; DB 2; Length 893; 

8e*t Local Similarity 31.7%; Pred. No. 2e-18; 

Matches 155; Coneervative 73; Hiamatchea 156; Indel* 105; Gap* 14; 

Qy 20 LCSCPSP- - •SMVRTQTESSTPPGIPGGSRQGPAMDGTAAEPRPGAGSLQHAQPP P 72 

I I II I :| h III I 11=11 I 

Db 421 LAS ATS PTSSSARKTSGS SN I ND K I PGQS VPPPNSFFPQBPSPKISDFP 469 

Qy 73 OPRKKR PEDFKFGKILGEGSFSTWLARELATSREYAI KI LBK 115 

:|h I h h -Ihl I I II hi Ihhl I 

Db 470 BPRRSRRLRTKS F SNKFQD I MVG PQS FE KI RLLGQGDVG KVFLVlUiiaCTNRVYALKVLSK 529 

Qy 116 RHI IKENIC^YVTRERDVMSRLDHPFFVKLYFTFQDDEKLYFGLSYAKIIGELLKYI - -RK 173 

:|| Ih I h-- Mil I II Ml -II : I II : : II 
Db S30 DEM I KRNKI KRVLTEQE I LATSNH PF I VTLYHS FQSEDYL YLCMEYCMGOB FFRALOTRK 589 

Qy 174 IGSFDSTCTRFYTABIVSALBYLHGKGI IHRDLKPBNILLNSDMHIQITDF 224 

I III :h Mlllll I IMIIIIIIIII- II -11 
Db 590 TKCI CEDDARFYASBVTAALBYLHLLGFI YRDLKPEN I LLHQSGH I KLSDFDLS I QAKDS 649 

Qy 225 GTAKVLS PBS KQA RANSFVGTAQYVSPBLLTBKSACKSSDLHALGCI I YQ 274 

hh -I I llllll M-ll- Mill Mh 

Db 650 KVPWKG SAQSTLVDTKI CSDGFRTNSFVGTEEY IAPEVI RGNGHTAAVDWMTLG I LI YB 709 

Qy 275 LVAGLPPFRAGNBYLI FQKI I KLEYDFP- -BKFFPKARDLVEfOlLVLDATKRLGCSBMBG 332 

- I Ih I I hi I II Ml-lll « Mlllh I 

Db 710 MLFG FTP FKJ3DNTNSTFTN I LKNBVSFPNNNS I SRTCKDL I KKLLTKNESKRLGCK- -HG 767 

Qy 333 YGPLKAH PF FE SVTWENLHQQT P P KLTA YLPAKS EDDEDCYGNYDNLLSQPGCHQVSS SS 392 



Db 

Qy 



Qy 

Db 



M lllh I I I I II 
768 AADVKKHPFFKKVQWSLLRNQSPP- 



M Mil III :| 
-LIPVLSEDGYDFAKLSSNKKRQ TSQ 816 



393 SSHSLSASDTGLPQRSGSNIEQYIHDLDSNSF BLDLQFSBDEKRLLLEKQAGGNPW 448 

II II Ih llh h 

817 DSH KHLDEQB KKMFBE RVEYDDEVSBDD PF 846 

449 HQFVENNLI 457 

I I =h 
847 HDFNSHSLM 855 



RESULT 6 
JQ1150 

protein kinase (EC 2.7.1.37) cAMP-dependent, catalytic chain - aliroe mold 

(Dictyoatelium dlacoideura) 

C; Specie*: Dictyoatelium diacoideum 

C;Dato: 31-Dec-1991 »sequence_revision 31-Dec-1991 8text_change 09-Jul-2004 
C;Acce*aion: JQ1150 

R;Buerki, S. ; Anjard, C; Scholder, J.C.; Reymond, CD. 
Gene 102, 57-65, 1991 

A; Title: Isolation of two genes encoding putative protein kinases regulated 

during Dictyoatelium diacoideum development. 

A; Reference number: JQ1150; MUID: 91323730 ; PMID: 1864510 

A; Accession: J01150 

A; Molecule type: DNA 

A; Residues: 1-648 <BUB> 

A; Cross-reference*: UNIPROT : P34099; UNIPARC:UPI000012DC1F; GB:M38703 
C; Genetic*: 
A; Gene: PK2 
Ajlntrons: 578/3 

C;Conplex: heterodimor with regulatory chain; active catalytic chain i* relea*ed 
when cAHP bind* to the regulatory chain 
C; Function; 

A; Description: catalyzes the formation of peptidyl-serine-phosphate or peptidyl- 
threonine -phosphate using ATP 

A; Note : important for cell type differentiation and fruiting body morphogenesis 
C; Supe r family: Dictyosteliura cAMP- dependent protein kinase catalytic chain; 
protein kinase homology 

C; Keyword*: ATP; magnesium; phosphoprotein; phosphotransferase; 

serine/ threonine- specific protein kinase 

F;126-223/Rogion: glutamine-rich 

F;297-312/Region: glutamine-rich 

P;334-590/Domain: protein kinase homology <KIN> 

P; 342-350/Region: protein kinase ATP-binding motif 

F;36S,384,459,461/Active site: Ly*. Glu, Asp, Ly* 0«tatus predicted 
F;464, 468/Binding aite: magnesium (Asn, Asp) tstatus predicted 



Query Match 19.3%; Score 568.5; DB 1; 

Best Local Similarity 36.7%; Pred. No. S.7e-17; 
Matches US; Conservative 68; Mismatches 113; 



Length 648; 



70 PPPQPRKKHPKDFKFGKI LGEGSFSTWIARELATSREYA I KI LEKRH 1 1 KENKVPYVTR 129 
III ::| m|| =:|| hi 11= Ihl I I -- 1 1 - 

324 PPPVNARSRLKEFKQ I RVI/TTGTFGKVYL IQNTKDGCYYAMKCLNKAYWQLKQVEHLHS 383 



I: -I : III I II Ill-Ill I Ih -II I ■ Ml III 
Db 384 SKSILSSIHHPFI VNL YQAFQDE KXL YLL FEYVAGG EVFTH LRKSMKF SNSTAKFYAAB I 443 

Qy 190 VSALBYLKGKGI IHRDLKPENILLNEDMHIQITDFGTAKVLSPBSKQARANSFVGTAQYV 249 

I llhll : hMIIIIIIM- Ihlllll II : = I II =h 

Db 444 VLALE FLH KQN I VYRDLKPENLL I DNQGH I KI TDFGFAKRV ED RT FTLCOTPEYL 498 

Qy 250 SPBLLTEKSACKSSDLHALGCI IYQLVAGLPPFRAGNBYLI FQKI I KLEYDFPBKFFPKA 309 

Ml- I h I Mil M--II III ■ h Ih II I I 

Db 499 APE I IQS KGHGKAVDWWAL/jILIFEMLAGYPPFYTJDDTFAI YNKI LAGRITFPLGPDVDA 558 



oy 



310 RDLVEKLLVLDATKRLGCEEMEGYG PLKAHPF FE5VTWENLKQQTP PKLTAYLPA 364 

MI-MI I hill : :| Mil : II hh Ih 
559 KDLI KRLLTADRTRRLGALK-DGAIX^TKNHRWFSDINVfERLYQRRDNGPFI PKI 611 

365 MSBDDEDCYGNYD 377 

. I • II 
612 QHQGDSSNPEMYD 624 



RESULT 9 
S68463 

protein kinase ATPK19 (EC 2.7.1.-) - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 05-Dec-1996 #sequence_revision 27-Feb-1997 «text_change 05-Oct-2004 
C; Accession: S68463 

R;Mizoguchi, T.; Hayashida, N. ; Yamaguchi-Shinozaki . K. ; Kamada, H.; Shinozaki, 
K. 

FBBS Lett. 358, 199-304. 1995 

A; Title: Two genes that encode ribosomal -protein S6 kinase homologs are Induced 
by cold or salinity stress in Arabidopsis thaliana. 
A;Reference number: S68462; MUID:95129712; PMID:7B28736 
A;Accession: S68463 

A; Status: nucleic acid sequence not shown 
A; Molecule type: DNA 
A; Residues: 1-471 <MIZ> 

A; Cross-references: UNIPROT: 03 9030; UNI PARC :UPI0 000 1 6DAA6 ; EMBL:D42061; 
NI0:gl526412; PIDN : BAA07661 . 1 ; PID:dl008243 ,- PID:g867995 

C; Keywords: ATP; phosphotransferase; serine/ threonine -specific protein kinase 
P;13B-395/Domain: protein kinase homology <KXN> 
F;146-lS4/Region: protein kinase ATP-binding motif 



Query Hatch 19.2%; 
Best Local Similarity 32.7%; 
Matches 128; Conservative 



Score 566; DB 2; Length 471; 
Pred. No. 5.4e-17; 
79; Mismatches 142; Indel s 42; 



oy 

Db 

oy 

Db 
Oy 
Db 



15 OSSVVTXSCPSPSMVTITOTBSSTPPOIPOGSRQGPAHDGTAAEPRP- 

Mill I I I I I I I 
96 EDSVDLVECVEGES I KBNDEFS GNDDTDSB KS PBBVSGWG 1 137 

71 PPQPRKKRPBDFKFGKILGEGSFSTVV1ARBLATSR8YAIKILBKRHI IKBKKVPYVTRB 130 

llh hMMM I h II Ihl- I h-l h I 

138 ED FBVLKWGQGAFG KVYQVRKKDTSB I YAMKVMRKDKI VE KNKAKYMKAB 188 

131 RDVKSRLDH PFFVKLY PTFQDOBKL YPGLSYAKNGELLKY I RXI GSFDBTCTRPYTAB I V 190 

llitti.HH hi -II Ml I i II « III I llllll 

189 RDILTKIDHPPI VQLKYS FQTKYRLYLVLDF I NGGHLFFQL YHQGLFREDLARVYTAB I V 248 



Oy 191 SALEYLKGKGI IHRDLKPBNIIJJJEDMHIQITDFGTAXVLSPSSKQARANSFVGTAQYVS 250 

I ■■ HI llhlllllllllh: I h :|llt II ■■ ill II :h: 

Db 249 eWSHI^BKDIKHRDLt^BNII>GJVDGKVKLTDPGLAKBF- - - EENTRSNSMCGTTBYMA 305 

Oy 251 PSLLTBKSACTCSSDLHALCCI I YQLVAGLPPFRAGiJEYLI FQKI I KLEYDFPEKFPPKAR 310 

II- I |::| |::| I III |n I III I i h hi 

Db 306 PB I VRG W2KDKAADWWSVG I LL YEMLTGKP P P • LGS KG KI QQKI VXD KI KLPQFVFNSAH 364 

Qy 311 DLVB KLLVLQAT KRLCCBHMEGYG PLKAK PPPBSVWENLH - QOTPP KLTAYLPAMSEDD 369 

|.i || : :||| | .| | : |'" l>l I - Ihl 

Db 365 ALLKGIXQKEPERRLG-SGPSGAEBIKKHKMFKAINWXKLEAREVQP- - -SFKPAVS- -G 418 

Oy 370 BDCTGNYDNLLSQFGCMQVSSSSSSHSLSAS 400 

I II :M : h 

Db 419 ROC I ANFD KCWTDMSVLDS PAS S PNSDAKAN 449 



RESULT 10 
T33662 

hypothetical protein W04B5.5 - Caenorhabditia elagana 
C; Species: Caenorhabditia elagana 

C;Date: 29-Oct-1999 »aoquence_revieion 29-Oct-1999 Btext_change 09-Jul-2004 

C;Accession: T33662 

R; Smith, A.; Gibson, A. 

submitted to the EMBL Data Library. October 1998 

A; Description: The sequence of C. elegane cosrald W04B5. 

A; Reference number: Z2138S 

A;Aceession: T33662 

A; Status: preliminary; translated f rota GB/EHBL/DDBJ 
A; Molecule type: DMA 
A;Residues: 1-320 <SMI> 

A; Cross-references ; UNI PROT : 09UA62 ; UNIPARC:UPI00000753FD; EMBL : AF100305 ; 

PIDN:AAC6B917.1; GSPOB:GN00021; CBSP:W04B5.S 

A; Experimental source: strain Bristol N2; clone W04BS 

C; Genetics: 

A; Gene: CBSP:W04B5.5 

A; Hap position: 3 

A.Introns: 27/3; 71/3; 117/1; 149/1; 218/3; 287/3 

C;Superfamily: kinase- related transforming protein; protein kinase homology 

Ouery Match 19.2%; Score 565. S; DB 2; Length 320; 

Beat Local Similarity 40.8%; Pred. No. 4e-17; 

Matches 129; Conservative 51; Mismatches 99; Indels 37; Gaps 7; 

Oy 74 PRKKRPE DPKFGKILGEGSFSTWLARSLATSR EYA1KILEKRHI IKE 121 

h || hi Ill Ml I hll II hi 

Db 10 PQNARREGSNMKClADYSVEIGiVGEGSPSTVYLARRKEPKNDENPEVALKI CLIGIL I LKN 69 



Oy 122 NKVPrVTRERDVMSRLD HPFFVKLYFTFQDDEKLYFGLSYAKNGELLKYIRK- - 1 174 

Mh lh= II I II I I I I I III I III hi ::| 

Db 70 KMVP Y I H RE KSALAL I S REENAH PG I VTLYAT P0DSBS L YFVLSYAKYGDLCTLMOKQ PD 129 

Qy 175 GSFDETCTRFYTAEIVSALEYLKGKGI IHRDLKPENILLNBDMHIQITDFGTAKVLS- - - 231 

h =hl I > - 1 I I I « - I M I II I : I :hh I I :UlhH II 
Db 130 SKFNVADSRYYAANLLSALEHIHKLGI IHRDVKADNLLVKSDGRIMLTDFGSSKFLSDYQ 189 



Oy 232 PESXOARANS7VGTAOYVSPBLLTBKSACKSSDLMALGCI I YQLVAGLPPPR 2B3 

I I =111111 UMIII llllll =1 t h II 

Db 190 KI QENPVESSQPTGRRSSFVGTAFFVTPEIJiTGSEMSPSSDLHAFSVTLYLPLTO I YPFN 249 

Qy 284 AGNBYL I FQKI I KLEYDPPEKP- FPKAFJ3LVEKlXVLDATKRL<OCEEMEGYQPLKAHPFF 342 

'lll'h'l Mill Mhhlll : II :| :| I II 

Db 250 DMSBYLVFRR IQDILYTFSED FPDENAXNL I BRLLVKEQKSRLTSQB IKEHKFF 303 

Oy 343 BSVTWBNLHQOTPPKL 358 

Ih >hl I ll>> 
Ob 304 ESIDFEHLEQLEPPRI 319 



RESULT 11 
S68462 

protein kinase ATPK6/ATPK1 (BC 2.7.1.-) - Arabidopsis t ha liana 
C; Species: Arabidopsis thai i ana (mouse -ear cress) 

C;Date: OS-Dec-1996 8aequence_rovision 13-Msr-1997 8text_change 05-Oct-2004 
C;Accession: S68462; A54141 

R;Mizoguchi, T. ; Hayaehida, N. ; Yemaguchi -Shinozaki , K. ; Kamada. H.; Shinozaki, 
K. 

PEBS Lett. 358, 199-204, 1995 

A; Title: Two genes that encode ribosoraal -protein 66 kinase homologs are induced 
by cold or salinity stress in Arabidopsis tha liana . 
A;Reference number: S68462; MUID: 95129712; PMID:7B28736 
A;Accession: S68462 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-46S <MI2> 

A; Cross -references: UNI PROT : P42818 ; UNIPARC:UPI000000178F; EKBL:D42056; 
NIO:g867996; PIDN:BAA076S6 . 1 ; PID:dlOOB238; PID:g867997 
R; Zhang, S.H.; Lawton, M.A.; Hunter, T.; Lamb, C.J. 
J. Biol. Chem. 269, 17586-17592, 1994 

A;Title: atpkl, a novel ribosomal protein kinase gene from Arabidopsis. I. 

Isolation, characterization, and expression. 

A;Reference number: A54141; MUID:94292S19; PMID:7912697 

A; Contents: ecotype Landsberg erecta 

A; Accession: A54141 

A; Status: preliminary 

A; Molecule type: DKA 

A; Residues: 1-465 <ZHA> 

A; Cross -references: UNI PARC ;UPI 000000 178F; GB:L29030; NID:g50B307j 
P1DN:AAA21142.1; PID:g50830B 

A;Nota: sequence extracted from NCBI backbone (NCBIN: 149344, NCBIP:149415) 
C; Keywords: ATP; phosphotransferase; serine/ threonine-specif ic protein kinase 
F;132-3B9/Domain: protein kinase homology <KJN> 
F;140-148/Region: protein kinase ATP-binding motif 

Query Match 19.2%; Score 565.5; DB 2; Length 465; 

Beet Locsl Similarity 35.8%; Pred. No. 5.6e-17; 

Matches 115; Conservative 75; Mismatches 122; Indels 9; Gaps 5; 

Qy 80 ED PKFGK I LGEG S FSTWLARELATS REYA I KI LE KRH 1 1 KENKVP YVTRERDVMSR1J>H 139 

:||: |::|:|:| | |: || ||:|:: | ||:::| |: |||:::::|| 
Db 132 DD FEVMKVVG KGAFGKVYOVRKKETSB I YAMKVKRKDH I ME KNHAEYMKAERD I LTKI DH 191 

Qy 140 P FPVKLYFTFQDDE KL YFG LSYAKNGELLKY I RKI QS FDETCTR FYTAE I VSALBYLHO K 199 



II hi ::H :M I : II III I lllllllh : ll I 

Db 192 PF I VQLKYS FQTKYRLYLVLD F I NGGHLF FQLYHQGLFREDLARVYTAE I VSAVSH LH E K 251 

Oy 200 GI IHRDLKPENILLNEDMNIQITDFGTAKVl£PESKQARANSFVGTAOTVSPEIXTEtCSA 259 

fhlimilllh: I h MM II : hll II =h:|h: I 

Db 252 G I MHRDLKPEN I IXDTDGHVMLTD FGLAKEF EENTRSNSMCGTTEYMAPE I VRGKGH 30B 



Qy 

Db 

oy 

Db 

oy 

Db 



260 CKSSDLHALGCI IYQLVAGLPPFRAGNEYLIFOKIIKLEYDFPEKFFPKARDLVEKLLVL 319 

|::| |::| ::|::: | ||| |:: | |||:| : |: :| ::: || 

309 D KAADWW SVG I LLYEMLTG KPPP-LGSKGKIQQKI VKD KI KLPQFLSNEAHA I LKGLLQK 367 

320 DATKRLG CEEKECTG P L KAH P F FE SVTWENLH QQT P P KLTAYLPAMSEDDEDCYGNYDNL 379 

368 EPERRLO-SGLSGAEEIKQHKHFKGINWKKL- -EAREVMPSFKPEVS- -GRQCIANFDKC 422 

3 B0 LSQFGCMQVSSSSSSHSLSAS 400 

: :|| I h 
423 WTDMSVLDSPASSPSSDPKAN 443 



RESULT 12 
A48094 

serum and glucocorticoid- regulated kinase - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21 -Jan- 1994 «sequence_revieion 18 -Nov- 1994 #text change 31 -Dec- 2004 
C;Acceasion: A4B094 

R; Webster, M.K.; Goya, L. ; Ge, Y. ; Maiyar, A.C.; Pireetone, G.L. 
Mol. Cell. Biol. 13, 2031-2040. 1993 

A; Title: Characterization of sgk, a novel member of the serine/ threonine protein 
kinase gene family which is transcriptionally induced by glucocorticoids and 
serum. 

A; Reference number: A48094; MUID: 93204949; PM1D:B4S5596 

A; Access ion: A4B094 

A; Status: preliminary 

A; Molecule type: nucleic acid 

A; Residues: 1-431 <HEB> 

A; Cross -references: UNI PARC :UPIOO0017A3EB 

A; Experimental source: ConB.hd6 mammary epithelial tumor cells 

A;Note: sequence extracted from NCBI backbone (NCBIN: 127618, NCBIP: 127619) 

C; Keywords: ATP 

F; 96-355/Domain: protein kinase homology <KIN> 
F;104-112/Region: protein kinase ATP-binding motif 



Ouery Match 19.1%; 
Beat Local Similarity 37.0%; 
Matches 125; Conservative 6 



Score 562.5; DB 2; 
Pred. No. 7e-17; 
I; Mismatches 134; 



Length 431; 



22 SCPSPSMVRTQTBSSTPPGI PGGSROGPAMDGTAAEPRPGAGSLQHAQPPPQPRKKRPBO 81 

:| | h: M I I I ■ M : I I III 

48 ACKH PB - VQSYLKI SQP OEPBLMNANPSPPPSPSQQINLGPSSNPHAK-PSD 97 

62 FKFGKILGEGSFSTVVLARBLATSRBYAIKILBKRH I IKENKVPYVTRBRDV-KSRLDHP 140 

I I h = hlll hill I Ihhhh hh • Ihl i : II 

98 FHFLKVIGKGSFGKVLIARHKAERAFYAVKVLQKKAILIQCK^ 157 

141 FFVTO- YFTFQDDBKLYFGLSYAKNQBLLKY I RKI GS FDETCTR PYTAB I VSALBYLHG KG 200 
I I hhll Mill I I III I I III III III III 



158 FLVGUi FS FQTAD KLY FVLDY I NGGELFYHLQRERCFLB PRARFYAAS I ASALGYLH S LN 217 

201 IIHRDLKPENI LLNEDMH 1 Q I TOFGTAKVLS PES KQARANS FVGTAQYVS PELLTEKSAC 260 

IMIIIIIIIIh II Mill I -I II :|::||:| :: 

218 IVYRDLKPBNILLDSQGHIVLTDFGLCK- -ENIEHNGTTSTFOGTPBYtAPEVLHKQPYD 275 

261 KSSDLWALGC I I YQLVAGLPPFRAGNEYLI FQKI I KLBYDFPSKFFPKARDLVEKLLVLD 320 

276 RTvUcLGAVLY^^ 335 



oy 



321 ATICRLGCEEMEGYGPLKAHPPFESVTWBNLHQQ--TPP 356 

Mil » : : :hl II . h>l « III 
336 RTKRLGAK--DDFMBIKSHIPFSLINWDDLINKKITPP 371 



RESULT 13 
S000B5 

protein kinase (EC 2.7.1.37), cAMP -dependent , type alpha, catalytic chain - pig 
(fragment) 

N; Alternate names: cAMP- dependent protein kinase C-alpha chain 
C; Species: Sus scrofa domestics (domestic pig) 

C;Date: 07-Sep-1990 Ssequence_revlsion 07-Sep-1990 8text_change 09-Jul-2004 
C; Accession: S000B5 

R;Adavani, S.R.; Schwarz, M.j Showers, M.O.; Maurer, R.A.; Hemmings, B.A. 
Eur. J. Biochem. 167, 221-226, 1987 

A; Title: Multiple mRNA species code for the catalytic subunit of the cAMP- 
dependent protein kinase from LLC-PK(l) cells. Evidence for two forms of the 
catalytic subunit. 

A;Reference number: S0O0BS; MUID : 87304 251 ; PMID:2441988 
A;Accession: S00085 
A; Molecule type: mRNA 
A; Residues: 1-338 <ADA> 

A; Cross -references: UNIPROT: P05383; UNIPARC:UPI000012DC1B; EMBL:X0S99B; 

NID:gl910; PIDN:CAA29415 . 1 ; PID:gl911 

A;Note: 48-Thr, 86-Met, and 96-Tyr were also found 

C;6uperfamily: kinase-related transforming protein; protein kinase homology 
C; Keywords: ATP; cAMP binding; heterotetramer; phosphoprotein; 
phosphotransferase; serine/ threonine- specific protein kinase 
F; 29-2BS/Domain : protein kinase homology <KIN> 
F;37-45/Reglon: protein kinase ATP-binding motif 
F;60/Active site: Lys "status predicted 

P; IBS/Binding site: phosphate (Thr) (covalent) flstatus predicted 
F; 326/Binding site: phosphate (Ser) (covalent) latatua predicted 

Query Match 19.1%; Score 561.5; DB 2; Length 338; 

Best Local Similarity 37.0%; Pred. No. 6.2e-17; 

114; Conservative 65; Mismatches 118; Indels 11; Gaps 5; 

72 PQPRKXRPBDPKFGKI LGEGSFSTWLARBLATSRSYAI KI LB KRH 1 1 KBNKVPYVTRSR 131 

I I 111= I II III hi ■ II : Ihllhh -I h 

21 PAPNNAGLEDFERKKTLGTOS FG R VMLVKH KATEQYYAMK I LD KQ KWKUCQ I EHTUJEK 80 

132 DVMSRIDH P P FVKL Y FTFQDD E KLY FGLSYAKNG ELLKY I RKI GS FDBTCTRPYTAB I VS 191 
- II hi hhh II ■■ I Ih -hll I I III hll 
81 R I LGA VNFPFL VRLE FSFKDNSNL YMVMBYVPGGEMFSHLRR I GRFSBPHARFYAAO I VL 140 



Qy 



192 ALBYLHGKGI IHRDLKPENILLNBDMHIQITDFOTAKVLSPESKQARANSPVXTTAQYVSP 251 



Mil =1=1111111=1== =11=1111 II = =1=11 =l==l 

Db 141 TFBYLH S LOL I YRDLKPENLL I DHQOY 1 OVTDFC FAKRV KGRTWTLCGTPBYLAP 195 

Qy 252 ELLTEKSACKSSDLHALGC1 I YQLVAGLPPFRAGNEYLI FQKI I KLEYDFPBKFPPKARD 111 

l== I |: I llll =ll== II III I l==ll= • II I =1 

Db 196 K 1 1 LS KGYNXAVDVWALGVL I YEHAAGYPPPFADQP I Q I YSKI VSG KVRFPSHFSSDLM3 255 

Qy 312 L VB KLL VLDATKRLG CE EMEGYG P LKAH PFFESVTWENLHQQTPPKLTA - YLPAM - SEDD 369 

I II :| III | | =1 I =1 = I = = h h I = = | I 
Ob 256 IXRNLLQVDLTKRFG - NLKNCVSD I JCTHKXFATTDW IAIYQR KVEAPP I PKFRGSGD 311 

Oy 370 BDCYGNYD 377 

: :|: 

Db 312 TSNFDOYE 319 



RESULT 14 
138121 

protein kinase - human 

C; Species: Homo sapisns (man) 

C;Date: 29-May-1998 «eequenco_revieion 29-Hay-199S »text_change 09-Jul-2004 
C; Access ion: 138121 

R;Klink, A.; Schiebel, K. ; Winkelmann, M. ; Rao, B . ; Horsthecke, B . ; Ludecke, 
H.J.j Claussen, U.; Scherer, O.j Rappold, G . 
Hum. Mol. Genet. 4, 669-678, 1995 

A;Title: The human protein kinase gene PKX1 on Xp22.3 displays Xp/Yp homology 

and is a aite of chromosomal instability. 

A; Reference number: 138121; KUID: 95360006 ; PHID: 7633447 

A;Acceasion: 138121 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-358 <RES> 

A; Cross -references: UNIPROT: PS1B17; UNI PARC : UPIO00000D998 ; EMBL: X8S54S; 
NID:gl052736; PIDN:CAAS9733 . 1; PID:gl052737 
C; Genetics: 
A; Gene : PKX1 

C;Superfamily: kinase- related transforming protein; protein kinase homology 
F;47-303/Domain: protein kinase homology <KIN> 

Query Match 19.1%; Score 560.5; DB 2; Length 358; 

Best Local Similarity 36.61; Pred. No. 7.2o-17; 

Matches 123; Conservative 63; Mismatches 119; Indels 31; Gaps 6; 

Oy 37 TPPGIPGGSRQGPAMIOTAAEPRPGAGSLOHAQPPPQPRKKRPEDFKFGKILGEGSFSTV 96 

II I I I I I I I 1=1 =11 =1 1=1 I 

Db 23 TPDGAP ALCPSPBALS PE PPVYSLQD FDTLATVGTGT FGRV 63 

Oy 97 VLARBLATSREYAI KILEKRH 1 1 KENKVPYVTRBRDVMSRLDHPFFVKLYFTFQDDBKLY 156 

I =1 >|>|>i =1= = =1 1= 1= = III = = l = = l= 1= II 

Db 64 HLVKEKTAKHPFALKVMS I PDVI RLKQEQHVHNE KSVLKEVSHPFL I RLFWTWKDERFLY 123 

Qy 1S7 FGLSYAKNG BLLKYIRKIGS FDETCTRFYTAB I VSALEYLHG KO 1 1 HRD LKP EN I LLNED 216 

= I III 1=1 II I I I = I I I = hllll I |< = I I I I I I I I I I = I 
Db 124 MLMEYVPGGELFSYLRNRGRFSSTTGLFYSAE 1 1 CAI EYLHSKE I VYRDLKPENI LLDRD 183 

Qy 217 MHIQI TDFGTAKVLS PES KQARANS FVGTAQYVS PKLLTE KSACKS SDLWALGC 1 1 YQLV 276 

ll-IIM II I I = II =! = = !! = = I == I llll =! = = = = 



Db 


184 


GHIKLTDFGFAKKLVD RTWTXOGTPBY1APBVIQSKGHGRAVDHWAL01LIFEML 238 


oy 


277 


AGLPP FRAGNEYX. I FQKI I KLEYD FPB KPFPKARDLVSKLLVLDATKRLGCSSMEGYG PL 336 




= 1 III 1 I 1 = 111= = III = ll = =IHhl hill 1 = 


Db 


239 


SGFPP FFDDNPFG I YQKI LAGKI DFPRHLDFHVKDLI KXLLWDRTRRLO -NMXNCANDV 2 97 


oy 


337 


KAHPFFESVTWENLHQQ--TPPKLTAYLPAMSEDDB 370 




i m ii in i= ii =i == i = 


Db 


298 


KHHRWFRSVDWBAVPQRKLKPP IVPKIAGDGD 329 



RESULT 15 
OXHYCB 

protein kinase (EC 2.7.1.37), cAMP -dependent, beta catalytic chain - Chinese 
hamster 

C; Species: Cricetulus griseus (Chinese hamster) 

CjDate: 31-Dec-1993 ftsequence^reviaion 31-Dec-1993 «text_change 09-Jul-2004 
C; Accession: A40384 

R.Howard, P.; Day, K.H.; Kim, K.B.; Richardson, J.; Thomas, J.; Abraham, I.; 
Fleischmann, R.D.; Ootteeman, M.M.; Maurer, R.A. 
J. Biol. Chem. 266, 10189-10195, 1991 

A; Title: Decreased catalytic subunit mRNA levels and altered catalytic subunit 
mRNA structure in a cAMP-resistant Chinese hamster ovary cell line. 
A;Reference number: A40384; KUID: 91244783; PMID:1645343 
A; Accession: A40384 
A; Molecule type: mRNA 
A; Residues: 1*351 <K0M> 

A; Cross •references : UNIPROT:P05206; UNIPARC:UPIO000OO40BP; GB:M63312; 
NID:gl91176; PIDN:AAA37011 . 1; PID:gl9U77 

C; Comment: The inactive enzyme contains two regulatory chains and two catalytic 
chains. Activation by cAMP produces two active catalytic monomers and a 
regulatory dimer that binds four cAMP molecules. Two types found in mammalian 
tissue are distinguished by having either type I or type II regulatory chains. 
C; Comment: Both alpha and beta catalytic chains are found in many tissues, with 
the alpha form being the more abundant. The highest concentrations of the beta 
form are found in brain. 

C; Super family: kinase -related transforming protein; protein kinase homology 
C; Keywords: ATP; blocked amino end; cAMP binding; hetorotetramer; lipoprotein; 
magnesium; myristylation; phosphoprotein; phosphotransferase; ser ine/ threonine - 
specific protein kinase 

P;2-351/Product: protein kinase. cAMP -dependent, beta catalytic chain tfetatus 
predicted <MAT> 

P;42-298/Domain: protein kinase homology <KIN> 
P;50-S8/Region: protein kinase ATP-binding motif 

F;2/Modified site: myristylated amino end (Gly) (in mature form) ftstatus 
predicted 

P;3/Modified site: aspartic acid (Asn) ftstatus predicted 

P;S5,56,122,12B,171,1B4/Binding site: Mg-ATP (Phe, Gly, Glu, Glu, Olu, Thr) 
ftstatus predicted 

F; 73, 92, 167, 169/ Active site: Lys, Glu, Asp, Lys ftstatus predicted 
F; 172, 185/Binding site: magnesium (Asn, Asp) tfetatus predicted 
F; 198/Binding site: phosphate (Thr) (covalent) ftstatus predicted 
F;339/Binding site: phosphate (Ser) (covalent) ftstatus predicted 

Query Match 19.0%; Score 55B.5; DB 1; Length 351; 

Best Local Similarity 36.7%; Pred. No. 8.5e-17; 

Matches 113; Conservative 66; Mismatches 118; Indels 11; Gaps 5; 



Qy 73 PQPRKKRPEDFKFGKI LGEGSFSTVVLARELATSREYAI KI LKKRH 1 1 KENKVPYVTRER 131 

I I llh I II III hi = II = Ihllhh = = l == = h 

Db 34 P P PSNAG LEDFERKKTLGTGS FGRVMLVKH KATEQYYAMK I LDKQKWKLKQI EHTLNEK 93 

Oy 132 DVMSRLDH P FFVKL Y FT FQDDB KL Y PGLSYAKNGELLKY I RKI GS FDETCTR FYTAE I VS 191 

= = == II 1 = 1 = = hh II = I Ih = = hll I I III hll 

Db 94 R I LQAVE PP FLVRLBYS FKDNSNL YMVMBYVPGGEM FSHLRR I GRFSE PHAR FYAAQ I VL 153 

Oy 192 ALEYLKG KO 1 1 HRD L KP EN I LLNBDMH I Q I TD PGTAKVLS PES KQARANS FVGTAQYVS P 251 

llll =hllHI1hh: =11 = 1111 II = = I = II :h=l 

Db 154 TFEYLHSLDL I YRDLKPENLL I DHQGY I QVTDFG FAKRV KG RTWTLCGTPBYLAP 208 

Oy 252 BLLTE KSACKSSD LMALGC 1 1 YQLVAG LP P FRAGNBYL I FQKI I KLBYD F PE KFFP KARD 311 

l== 1 h I llll =11 = : II III | h = lh : II I =1 

Db 209 EIILSKGYNKAVDWWALGVLIYEMAAOYPPFFADQPIQIYEKIVSGKVRFPSHFSSDLKD 268 

Oy 312 LVE KLLVLDAT KRLGCEEMEGYG PL KAHP F FE SVTWENLHQQTPP KLTA - YLPAM - SBDD 369 

h II =1 MM I :| | :| : | ::|: 1= I = = l I 
Db 269 LLRNLLQVDLTKRFG-NLKNGVSDIKTHKMFATTDWIAIYQR KVBAPPI PKFRGSGD 324 

Qy 370 BDCYGNYD 377 

: :|: 

Db 325 TSNFDDYE 332 



Search completed: June 27, 2006, 01:37:05 
Job time : 44 sees 



Gen Co re version 5.1.9 
Copyright (e) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 



Title: 

Perfect score: 
Sequence : 



June 27, 2006, 01:48:57 ; Search time 

(without alignments) 

66S.533 Million cell updates/sec 

US-10-689-576-1 
2941 

1 MARTTSQ L YDA VP IQSSWL RK I QEVWRQ R YQSH PDAAVQ 556 



Scoring table: BLOSUM62 

Oapop 10.0 , Gapext 0.5 

Searched: 99297 seqs, 22B45552 residues 

Total number of hits satisfying chosen parameters: 99297 

Minimum 08 seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Publiahed_Applications_AA_New : • 

1 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US09_NEW_PUB. pep : * 

2 : /EMC~Celerra_SIDS3/ptodata/2/pubpaa/US06_NEW_PUB.pep: • 

3 : /BMC_Celerra~SIDS3/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

4 : /EMC_Celerra_SIDS3/ptc<lota/2/pubpao/US08~NEW_PUB.pep: * 

5 : /SMC~Celerra_SIDS3/ptc-data/2/pubpaa/PCT_NEW_PUB . pep : * 

6 : /EMC_Celerxa_SIDS3/ptodata/2/pubpaa/US10_NEW_PUB. pep : * 

7 : /EMC~Celerra~SIDS3/ptodata/2/pubpaa/USll_NEW~PUB • P«P : * 

8 : /EMC~Celerra_SIDS3/ptodata/2/pubpaa/US60_NEW_PUB . pep : • 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Query 

Score Match Length OB 



1 


1367 


46. S 


687 


6 


US-10-449 


-902-53930 


Sequence S3 930, 


A 


2 


812 


27.6 


498 


6 


US-10-449 


-902-51984 


Sequence 51984, 


A 


3 


736.5 


25.0 


417 


6 


US-10-449 


-902-33335 


Sequence 33335, 


A 


4 


561 


19.1 


431 


6 


US-10-SOS 


92B-845 


Sequence 845, A 


PP 


5 


561 


19.1 


431 


6 


US-10-472 


622A-2 


Sequence 2, Appli 


6 


558 


19.0 


480 


6 


US-10-449 


-902-50842 


Sequence 50842, 


A 


7 


551 


18.7 


340 


6 


US-10-953 


349-17750 


Sequence 17750, 


A 


8 


SSI 


18.7 


418 


6 


US-10-9S3 


349-17749 


Sequence 17749, 


A 


9 


551 


18.7 


479 


6 


US-10-953 


349-17748 


Sequence 17748, 


A 



550. 5 


18 


7 


351 


6 


US-10-50S-92B-658 


Sequence 


658, App 


544.5 


IB 


5 


693 




US-10-449- 902-37629 


Sequence 


37629, A 


544 


18 


5 






US-10-449- 902-41431 






537.5 


IB 


3 


4S5 




US- 10 -449-902-44100 


Sequence 


44100* A 


526.5 


17 


9 


706 




US- 10 -511 -937-2450 


Sequence 


2450 An 


512.5 


17 


4 


556 




US- 10 -44 9- 902 -54017 


Sequence 


54017 A 


513.5 


17 


4 


685 




US-10-449- 903 -37624 




3 _ * 


512 


17 


4 


480 


£ 


US- 10-511-814-6 


Sequence 


6 Appli 


512 


17 


4 


480 


7 


US- 11 -289- 102-325 


Sequence 


325, App 


509 


17 


3 






US- 10 -44 9-902 -44738 


Sequence 




508.5 


17 


3 


544 


7 


US- 11 -293 -697-3334 


Sequence 


3334 B 'ap 


502 


17 


1 


733 




US- 10-505-926-349 


Sequence 


349, App 


490 


16 


7 


630 


7 


US- 11 -293 -697-3776 


Sequence 


3776, Ap 


478. S 


16 


3 


310 




US- 10 -449- 902-33480 






478.5 


16 


3 


451 


5 


US- 10-449-902 -47001 


Sequence 


47001, A 


474.5 


16 


1 


451 




US -10 -449 -902 -44457 


Sequence 


44457, A 


468.5 


15 


9 


498 


6 


US -10 -953 -34 9- 3867 


Sequence 


3867, Ap 


468.5 


15 


9 


519 




US-10-953 -349-3866 




3866, Ap 


468.5 


15 


9 


S63 


fi 


US-10-953 -349-3865 


Sequence 


3865, Ap 


463.5 


15 


a 


592 




US- 10-505*928-146 


Sequence 


146, App 


462 


15 


7 


587 


6 


US-10-953 -349-17119 


Sequence 


17119, A 


462 


15 


7 


674 


6 


US- 10-953-349-17118 


Sequence 


17118, A 


456 


15 


5 






US- 10-953-349-17120 






454 


is 


4 


434 


6 


US- 10 -44 9-902 -45070 


Sequence 


45070* A 


454 


IS 


4 


457 


6 


US- 10 -44 9- 902 -54251 


Seouence 


54251* A 


454 


15 


4 


465 


e 


US-10-953 -349-20724 


Seouence 


20724* A 


454 


15 


4 


483 


6 


US-10-953 -349-20723 


Sequence 


20723, A 


454 


15 


4 


503 


6 


US-10-953-349-20722 


Sequence 


20723, A 


453.5 


IS 


4 


542 


7 


US-11-090-6B9B-3 


Sequence 


3, Appli 


452 


15 


4 


782 


6 


US-10-449-902-42854 


Sequence 


42854, A 


451 


15 


3 


459 


6 


US-10-953-349-38583 


Sequence 


38583, A 


450 


15 


3 


4S1 


6 


US-10-9S3-349-365B4 


Sequence 


38S84, A 


450 


15 


3 


502 


6 


US-10-449-902-SS917 


Sequence 


55917, A 


450 


15 


3 


574 


6 


US-10-449-902-38154 


Sequence 


38154, A 


450 


15 


3 


574 


6 


US-10-449-902-50422 


Sequence 


S0422, A 


447. S 


15 


2 


344 


6 


US-10-505-928-387 


Sequence 


387. App 



RESULT 1 

US-10-449-902-53930 

; Sequence 53930, Application US/ 1044 9 90 2 
; Publication No. US20060123505A1 
j GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

; APPLICANT: Bio-oriented Technology Research Advancement Institution. 

; APPLICANT: The Institute of Physical and Chemical Research. 

APPLICANT: Foundation for Advancement of International Science. 

TITLE OP INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

FILE REFERENCE: MQA-A020SY1-US 

CURRENT APPLICATION NUMBER: US/10/449,902 

CURRENT FILING DATE: 2003-05-29 

PRIOR APPLICATION NUMBER: JP 2002-203269 

PRIOR FILING DATS: 2002-05-30 

PRIOR APPLICATION NUMBER: JP 2002-383870 



; PRIOR FILING DATE: 2002-12-11 
; NUMBER OF SEQ 10 NOS: 56791 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 53930 

LENGTH: 687 

TYPE: PRT 

ORGANISM: Oryza sativa 
US-10-449-902-53930 



Query Match 46.5%; 
Best Local Similarity 51.3%; 
Matches 278; Conservative 7 



Score 1367; DB 6; Length 687; 
Pred. No. 1.8e-60; 
I; Mismatches 120; Indels 70; 



Oy 


76 


Db 


141 


Oy 


136 


Db 


201 


Oy 


190 


Db 


261 


Oy 


240 


Db 


321 


Qy 


300 


Db 


381 


Qy 


358 


Db 


441 


Oy 


388 


Db 


501 


Oy 


437 


Db 


561 


Oy 


486 


Db 


621 


Oy 


546 


Db 


681 
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1 = 1 I 



-DCYGNYDNLLSQFGCMO 3B7 
II I 



-NI EQYIHDLDSNSFBLDLQPSBDBKR- 

= == I =1 = == ==l 



LIJ»BI*QAGGNPWHQPVE1<NLILK>1GPVDKRJ03LFARRRQL1XTEGPHLY 4BS 

= h = ll I II Hlllllll I 1 = 111111 = 111 Hllllllll 
561 KLPNYSPEQALIQLDRQARENKWHPFVENNLILiOWFVBKIUCGLPSRRRM 620 



llll I llllllllhlll 



=11 lillllllllll II I I II 1=1= 



Publication No. US20060123505A1 
GENERAL INFORMATION: 

APPLICANT: National Institute of Agrobiological Sciences. 

APPLICANT: Bio-oriented Technology Research Advancement Institution. 

APPLICANT: The Institute of Physical and Chemical Research. 

APPLICANT: Foundation for Advancement of International Science. 

TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

FILE REFERENCE : MOA-A0205Y1-US 

CURRENT APPLICATION NUMBER: US/ 10/449, 902 

CURRENT FILING DATE: 2003-05-29 

PRIOR APPLICATION NUMBER: JP 2002-203269 

PRIOR FILING DATE: 2002-OS-30 

PRIOR APPLICATION NUMBER: JP 2002-383870 

PRIOR FILING DATE: 2002-12-11 

NUMBER OP SEQ ID NOS: 56791 

SOFTWARE: Patentln Ver. 2.1 
SEQ 10 NO 51984 
LENGTH: 498 
; TYPE : PRT 

; ORGANISM: Oryza sativa 
US-10-449-902-51984 

Query Match 27.6%; Score B12; DB 6; Length 498; 

Best Local Similarity 37.6%; Pred. No. 2e-33; 

Matches 198; Conservative 74; Mismatches 189; Indels 66; Gapi 



oy 



oy 

Db 

oy 
Db 
Oy 
Db 



50 AMDGTAAE P RPGAGSLQHAQP PPQ PRK - 
1=1 I I h I I I 

2 AVGGDDDME RD FAARLRLAPS PAS 1 



KRPB DFKFGKI LGEGS 92 

= 1= Ih HI I II 

XAFRAPQEOFTVGD FELG KI YGVGS 61 

93 FSTWLARBLATSRBYAI KI LEKRH 1 1 KENKVPYVTRERDVMSRLDHPFFVKLYPTFQDD 152 

= 1 II I" I Ihlh'h I llll- II II 1= =1111 ::|:||||| 
62 YSKVVRAK10XDTGNVYALKI MDKKF I TKENKI SYVI04BR I VLDQLDH PGV I RLPPTFQDT 121 

153 B KLYFGLSYAXNGELLKY I RKI GS FDETCTR FYTAE I VSALBYLHG KG 1 1 HRDLKPENI L 212 

II I =111 1=1 1 III llll lllll 1=1111-1111=1 
122 YSLYMALE S CEGGELFDQ I VRKDRLSEDBARFYAAE I VD I LBYLH5LGL I HRDVKP ENLL 1B1 

213 LNSDMHIQITDFGTAK VLSPESKQARANSFVGTAQYVSPBLLTEKSACKSSDL 265 

I I M l I 1= I = I I II =11111 II Ihl =11 

182 LTSDGH I KI ADSGSVKPTKDTP I KVLPNSTNERACTFVGTAAYVPPBVLNSAPPTFGNDL 241 

266 WALGCI I YQLVAGLPPFRAGNSYLI FQKI IKLBYDFPEKFFPiARDLVBKLLVLDATKRL 325 

lllll =lll==l 11= =1=1111=11 I II I llll==lll =1 =11 
242 WALGCTLYQLLSGSSPFKDASEWLI FQRI IARDLRIPEYFSDDARDLIDKLLDVDPSKRP 301 

326 GCEEMEGYGPLKAH P F PESVTVENm QQTPP KLTA YLPAMSEDO EDCYGNYDNLLSQFGC 385 

I =11 II llll = 1 = 1= HI = I = =111 = II I 

302 GAGP-DGYVSXJGQiPFFRGIDWIQilIRSTRAPKLA- -MEANANEDEDSQDS- -SWLSHMG- 355 



I' : I 



GLPQRSGSNI EQYI HOLDS- NSFBLDLQFSB 
I I I = =1 I I =11= 

O- -ASSSSBVRSHISRLASIDSFD- - - 



RESULT 2 

US-10-449-902-51984 

; Sequence 51984, Application US/10449902 



oy 

Db 



44S GNPWHQFVENN LILKMGP\TOXRfQQLFARRRQLLLTEGPHLYYVDPVNKVL103BIPW 500 

■ I hi II h = = l I == Ihlh I I III I II I I 

392 -SRWQDPLEPGESWLISKLKXINK LTNKKVQLILTOIO?QLICVDPGKMVTKGNIMW 447 



501 S---OHLRPEAKNFKTFFVHTPNRTYYLKDPSGNAHKWCRiaQHVVfR 544 

1 || : | | : || : | | :| : |::: | 

44S SDDPSBLWVQVSNSSHFRICTPKKVSSFEDAKQRAVQWKKAIEDLQR 494 



RESULT 3 

US-10-449-902-33335 

; Sequence 3333S, Application US/10449902 
; Publication No. US2006012350SA1 
GENERAL INFORMATION: 

APPLICANT: National Institute of Agrobiological Sciences. 

APPLICANT: Bio-oriented Technology Research Advancement Institution. 

APPLICANT: The Institute of Physical and Chemical Research. 

APPLICANT: Foundation for Advancement of International Science. 

TITLE OF INVENTION; FULL-LENGTH PLANT cDNA AND US 

FILE REFERENCE: NOA-A020SY1-US 

CURRENT APPLICATION NUMBER: US/10/449.902 

CURRENT FILING OATS : 2003-05-29 

PRIOR APPLICATION NUMBER: JP 2002-203269 

PRIOR FILING OATS: 2002-05-30 

PRIOR APPLICATION NUMBER: JP 2002- 3 B3 870 

PRIOR FILING DATE: 2002-12-11 

NUMBER OF SEQ 10 NOS: 56791 

SOFTWARE: Patent In Ver. 2.1 
SB0 ID NO 33335 
LENGTH: 417 
TYPE: PRT 
; ORGANISM: Oryza aativa 
US-10-449-902-33335 



Query Match 25.0%; 
Best Local Similarity 3B.7%; 
Matches 173; Conservative 6 



Score 736.5; DH 6; 
Pred. No. B.3e-30; 
>; Mismatches 160; 



Length 417; 

Indels 49; Gaps 



Oy 113 LBKRHI IKENKVPYVTRERDVMSRLOHPFFVKLYFTFODDBKLYFGLSYAKNGBLLKYIR 172 

I Mil: II II h HIM "I'lllM II I « III I 

Db 1 MDKKFITKENKISY\nOffiRIVlJ)QLDHPGVIRLFFTFQDTYSLYKAI^ESCEGGELFDQIV 60 

Oy 173 KIGSFDBTCTRFYTASIVSALEYLHGKGI IHRDLKPENILLNEDMHIQITDFGTAK 228 

Db 61 RKGRLSEDEARFYAAE I VD I LEYLHSLGLIHRDVKPENLLLTSDGH I KIADFGSVKPTKD 120 

0y 229 - - - VLSPESKQARANSFVGTAQYVSPEU,TEKSACKS5DLWALGCI I YQLVAGLPPFRAG 285 

: I I II 'Mill II Ihl MINIM :|||::| Ih 

Db 121 TPIKVLPNSTlffiRACTPVGTAAYVPPBVl^SAPPTFGNDLWA^ 180 

Oy 286 NEYLI FQKI I IO£YDFPEKFFPKARDLVEKLLVLOATKRLGCEEMEGYGPLKAHPFFESV 345 

MMIIIMI : II I Mil-Ill :| =11 I Ml II I II I i 
Db 181 S EWL I FOR 1 1 ARDLKI PBYFSDDARDL I D KLLDVDPS KRPGAG P - DG YVS LKKH P PFRG I 239 

Oy 346 TWENLH QQTP PKLTAYLPAMSEDDEDCYGNYDNLL5QFGCMQVS SSSS5H SLSA5DTG LP 405 

hh III = I = Mil . i II I h I I : I 

Db 240 DWKNIRSTRAPKLA- -MEANANEDEDSQDS- -SWLSHMG SAPVNQHVSPVGNDG- - 289 

Oy 406 QRSGSNIBOYIHDLDS-NSFBLDLOFSEDBKRLLLBKOAGGNPWHQFVBNN LILKM 460 

IM ' I II MM . I IM II h 



Db 


290 


Oy 


461 


Db 


330 


oy 


518 


Db 


387 


RESULT 4 




US-10-505 


-928 



290 ASS5SEVRSHISRLASIDSPD-- 



-SRWQDPLBPGESWLISKL 329 



I " IIMh I I III I II I II II . I I : 
-LTNKKVQLILTDKPCLICVDPOKMVTKCNIKWSDDPSBLNVOVSNSSHFRIC 386 



I hh h 



Sequence 845, Application US/10505928 

Publication No. US20060088532A1 

GENERAL INFORMATION: 
APPLICANT: Ludwig Institute for Cancer Research et al. 
; TITLE OF INVENTION: LYMPHATIC ENDOTHELIAL GENES 
; PILE REFERENCE : 28967/39178 
; CURRENT APPLICATION NUMBER: US/10/505. 928 
; CURRENT FILING OATS: 2004-08-27 
; PRIOR APPLICATION NUMBER: US 60/363,019 
; PRIOR FILING DATE: 2002-03-07 
t NUMBER OF SEQ ID NOS: 866 
; SOFTWARE: Patent In 3.2 
! SB0 ID NO 84 S 
; LENGTH: 431 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-S05-92B-64S 



Query Match 19.1%; 
Best Local Similarity 38.2%; 
Matches 120; Conservative 62; 



I M III I 



561; DB 6; Length 431; 
>d. No. 3.3o-21; 
Mismatches 112; Indels 20; 
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Db 
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oy 
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Db 
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oy 


165 


Db 
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oy 


22S 


Db 
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Oy 
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300 


Oy 
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I I II f.\* St MMI : 



1ATKRLGCSEMEGYGPLKAHPFPES 344 

II IM II I Mill : MM II 

- DDFMB I KSHVFFSL 357 



■ MM 



III 



358 INWDDLINKKITPP 371 



RESULT 5 

US-10-472-622A-2 

; Sequence 2, Application US/10472622A 

; Publication No. US20060127892A1 

; GENERAL INFORMATION: 

; APPLICANT: BUSJAHN, ANDREAS 

; APPLICANT: LUFT, FRIEDRICH C. 

j APPLICANT: LANG, FLO R IAN 

; TITLE OP INVENTION: QUANTITATIVE DIAGNOSTIC ANALYSIS OF HYPERTENSION 
; PILB REFERENCE: IBHWH-0016 

; CURRENT APPLICATION NUMBER: US/ 10/472 , 622A 

; CURRENT FILING DATE: 2003-09-22 

; PRIOR APPLICATION NUMBER: PCT/EP02/03180 

PRIOR PILING DATE: 2002-03-21 

NUMBER OF SEQ ID NOS: 16 

SOFTWARE: Patent In Ver. 3.3 
SEO ID NO 2 
; LENGTH: 431 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-472-622A-2 

Query Match 19.1%; Score 561; DB 6; Length 431; 

Best Local Similarity 38.2%; Pred. No. 3.3e-21j 

Matches 120; Conservative 62; Mismatches 112; Indels 20; Gapt 
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59 
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62 
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Db 
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165 


Db 


182 


oy 
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Db 


242 
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285 


Db 
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Db 
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M II I hMMII hill I 
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RESULT 6 

US-10-449-902-50842 

; Sequence 50842, Application US/10449902 



M II M -MM I I II -h 



Mill 



II IM II 



I Mill : 
3RTKRLGAK- 



: MM II 
-DDFMB I KSHVFFSL 357 



; Publication No. US20060123505A1 
; GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

APPLICANT: Bio -oriented Technology Research Advancement Institution. 
APPLICANT: The Institute of Physical and Chemical Research. 
APPLICANT; Foundation for Advancement of International Science. 
TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 
FILE REFERENCE: MOA-A020SY1-US 
CURRENT APPLICATION NUMBER: US/ 10/44 9, 902 
CURRENT FILING DATE : 2003-05-29 
PRIOR APPLICATION NUMBER: JP 2002-303269 
PRIOR FILING DATE: 2002-05-30 
PRIOR APPLICATION NUMBER: JP 2002-383870 
PRIOR PILING DATE : 2002-12-11 
NUMBER OF SEQ ID NOS: 56791 
SOFTWARE: Patentln Ver. 2.1 
SEQ 10 NO 50842 

LENGTH: 480 

TYPE: PRT 

ORGANISM: Oryza sativa 
US-10-449-902-50842 



Query Match 19.0%; 
Best Local Similarity 37.8%; 
Matches 113; Conservative 6 



Score 558; DB 6; Length 480; 
Pred. No. 5.2o-21; 
); Mismatches 108; Indels 1 



Oy 


80 


Db 


149 


Oy 


140 


Db 


209 


Oy 


200 


Db 


269 


Oy 


260 


Db 


326 


Oy 


320 


Db 
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RESULT 7 

US-10-953-349-17750 

; Sequence 17750, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALBXANDROV, Nickolai et al. 

; TITLE OP INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES 
; TITLE OP INVENTION: 
; FILE REFERENCE: 2750-1579PUS2 



; CURRENT APPLICATION NUMBER: US/10/ 9S3 , 349 

; CURRENT PILING DATE: 2004-09-30 

; NUMBER OP SEQ 10 NOS: 40253 

; SOFTWARE: Patentln version 3.3 

; SEQ ID NO 17750 

; LENGTH: 340 

; TYPB : PRT 

ORGANISM: Glycine max 
US-10-9S3-349-17750 



Query Match 18.7%; 
Best Local Similarity 39.1%; 
Matches 106; Conservative 6 



Score 551; DB 6; Length 340; 
Prod. No. 6.2o-21; 
I; Mismatches 100; Indels 
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RESULT B 

US-10-953-349-17749 

Sequence 17749, Application US/10953349 
Publication No. US20060107345A1 
GENERAL INFORMATION: 
APPLICANT: ALEXANDROV. Nickolai et al. 

TITLE OP INVENTION: SEQUENCE -DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES 

TITLE OF INVENTION: ENCONDSO THERBY 
FILE REFERENCE : 2750 -1S79PUS2 
CURRENT APPLICATION NUMBER: US/ 10/ 953, 349 
CURRENT FILING DATE: 2004-09-30 
NUMBER OF SEQ ID NOS: 40252 
SOFTWARE : Patent In version 3.3 
SSQ ID NO 17749 

LENGTH: 41B 

TYPB: PRT 

ORGANISM: Glycine max 
US-10-953-349-17749 



Query Match 

Best Local Similarity 



18.7%; 
39.1%; 



Score 551; DB 6; Length 418; 
Pred. No. le-20; 



Matches 106; Conservative 61; Mismatches 100; Indole 
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RESULT 9 

US-10-953-349-17748 

; Sequence 17748, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al . 

; TITLE OF INVENTION: SEQUENCE -DETERMINED DNA 

POLYPBPTIDBS 

; TITLE OF INVENTION: BNCONDBD THERBY 

; PILE REFERENCE : 2750-1579PUS2 

! CURRENT APPLICATION NUMBER: US/ 10/ 953 , 349 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS: 402S2 

; SOFTWARE: Patent In version 3.3 

; SEQ ID NO 17748 

; LENGTH: 479 

; TYPE: PRT 

; ORGANISM: Glycine max 

US-10-9S3-349-1774B 



FRAGMENTS AND CORRESPONDING 



Query Match 18.7%; 
Best Local Similarity 39.1%; 
Matchea 106; Conservative 6 



Score 551; DB 6; Length 479; 
Pred. No. 1. le-20; 
.; Mismatches 100; Indole 



80 EDFKFGKI LGEGSFSTWLARELATSREYAI KI LEKRH 1 1 KENKVPYVTRERDVMSRLDH 139 
•II' h = hl'|i I h II Ihh: | |:::| |, |||: ::::| 
148 DDFE I LKVVGQGAPAKVYQVRKKGTSE I YAMKVMRKDKIMBKNHAEYMKAERD IWTKI EH 207 

140 P FF^j^YFTF^DH ^YFGl^AiOJGELLKY I RKI GSFDETCTRFYTAEI VSALEYLHGK 199 

208 P PWQLRYSFQTKVRLYLVIJ3 FVNQGHLFFQL YHQGLFREDLAR I YTAB I VSAVSHLHAN 267 

200 GIIHRDLKPENI LLNEDMH I Q I TDFGTAKVLS PES KQARANS FVGTAQYVS PELLTE KSA 259 
Ihllllllllllh I h Hill II : hll II i| = i|hi I 



Db 368 G I MHRDLKPEN I LLDADGHVMLTD FG LAKQP E SSTRSN5MOQTLEYMAPE 1 1 LGKGH 324 

Qy 260 CKSSDLHALGCI I YOLVAGLPPFRAGNEYLI FQKI I KLEYDFPEKFFPKARDLVEKLLVL 319 

l"l |M| :::::: | ||| || | llhl . I >| h= II 

Db 325 D KAADWWSVGVLLPEMLTGKP PFOGGNRD KI 0Q KI VKDKI KLPAPLS SEAH SLLKGLLQK 384 

Oy 320 DATKRLGCEEMEGYGPLKAHPPFESVTWENL 350 

■WW I >hl >|> i | I 
Db 3B5 EQARRLGCGP - RGVEB I KSHKWFKP I NWRKL 414 



RESULT 10 
US-10-505-928-658 

; Sequence 658, Application US/10505928 
; Publication No. US200600B8532A1 
; GENERAL INFORMATION: 

; APPLICANT: Ludwig Institute for Cancer Research et al. 

; TITLE OP INVENTION: LYMPHATIC ENDOTHELIAL GENES 

; FILE REFERENCE: 28967/39178 

; CURRENT APPLICATION NUMBER: US/10/505. 92B 

; CURRENT PILING DATE: 2004-06-27 

; PRIOR APPLICATION NUMBER: US 60/363.019 

; PRIOR FILING DATE: 2002-03-07 

; NUMBER OF SEQ ID NOS: 866 

; SOFTWARE : Patentln 3.2 

; SEQ ID NO 658 

; LENGTH: 351 

; TYPB: PRT 

; ORGANISM: Homo sapiens 

US-10-505-928-658 

Query Match 18.7%; Score 550.5; DB 6; Length 351; 

Best Local Similarity 37.0%; Pred. No. 9e-21; 

Matchea 111; Conservative 65; Mismatches 113; Indels 11; Gapa 5; 



Qy 80 ED FKFGKI LGEGS FSTVVLARELATSRSYA I KI LEKRH 1 1 KENKVPYVTRERDVMSRLDH 139 

llh I II III hi ■ II : Ihllhh ::| " h 
Db 42 EDFBRKKTLGTGS FGRVMLVKH KATSQYYAMKI U) KQKWKL ICQ I EHTLNB KR I LQAVNP 101 

Qy 140 P FFVKLYPTFQDDEKLYFGLSYAKNGSLLKY I RKI GS FDETCTRPYTAE I V SALBYLH G K 199 

II hi ■■ hh II : I Ih -hll | | III |:|| UK 
Db 102 PFLVRLEYAPICDNSNLYMVMEYVPGGEMFSHUUl IGRFSEPHARFYAAQI VLTFBYLHSL 161 

Oy 200 GI IHRDLKPENILLNEDMHIQ I TDFGTAKVLS PBS KQARANS PVGT^ 259 

:hllllllhh: :|hllll II = i I i II :|::||:: I 

Db 162 DLIYRDIJCPEN1XIDHQGYI0VTDFGFAKRV KGRTWTLCOTPBYLAPBIILSKGY 216 

Qy 260 CK5SDLMALGCI I YOLVAGLPPFRAGNEYLI FQKI I KLEYDPPEKFPPKARDLVEKLLVL 319 

h I llll =lh: II III I h = lh ■■ II I Mh II : 

Db 217 NKAVDWWALGVL I YBMAAGYP PPPADQP I QI YE KI VSGKVRPPSH FSSDLKDLLRNLLQV 276 

Oy 320 DATKRLGCE EMEGYG PL KAH P FFESVTWENLiH QQT P P KLTA - YL P AM - SEDD EDCYGNYD 377 

I III I I ! I I 1 1 i I nh h I ::| I : \ 

Db 277 OLTKRFG-NLKNGVSDIKTH1CWFATTDWIAIYQR- - -KVEAPFIPKPRGSGDTSNFDDYB 332 



US-10-449-902-37629 

; Sequence 37629. Application US/10449902 
; Publication No. US2OO6O12350SA1 
; GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

; APPLICANT: Bio -oriented Technology Research Advancement Institution. 

; APPLICANT: The Institute of Physical and Chemical Research. 

; APPLICANT: Foundation for Advancement of International Science. 

; TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

; FILE REFERENCE: MOA-A0205Y1-US 

; CURRENT APPLICATION NUMBER: US/10/449.903 

; CURRENT PILING DATE : 2003-05-29 

; PRIOR APPLICATION NUMBER: JP 2002-203269 

; PRIOR FILING DATE: 2002-05-30 

; PRIOR APPLICATION NUMBER: JP 2002-383870 

; PRIOR PILING DATE: 2002-12-11 

; NUMBER OF SEQ ID NOS: 56791 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 37629 

; LENGTH: 693 

; TYPB: PRT 

; ORGANISM: Oryza aativa 
US-10-449-902-37629 

Query Match 18.5%; Score 544.5; DB 6; Length 693; 

Best Local Similarity 31.6%; Pred. No. 3.4e-20; 

Matchea 129; Conservative 74; Mismatches 132; Indels 73; Gaps 10; 

Qy 71 PPQPRKKRP- - EDFKFGKI LGEGSFSTVVLARELATSREYAIKILEKRHI IKENKVPYVT 128 

llll Mh I i hi I Ih: I I I I I I >|:i| | : 
Db 266 PVHPVKDRTS I DDPEI I KPI SRGAFGRVFLAKKRTTGDLFAI KVLRKADMI RKNAVBS I L 325 

Qy 129 RBRDVMS RLDH PFFVKL Y FTFQDDE KL YFGLSYAKNGBLLKY I RKI GS FDBTCTRFYTAE 188 

llh: : ill h ml III : I hi U Mil II II 
Db 326 AERD I L I TVRNPFVVRFPYSFTSRENL YLVMEYUK3GDL YSLLRNIACLDEDVAR I YLAB 385 

Qy 189 IVSALEYLKGKGI IHRDLKPENILLNEDMHIQITDFGTAKV LS 231 

:| llllll hlllllhhh I Ihlllll ill II 
Db 386 WLALEYUiSMH I VHRD LKPDNLL I AKDGH I KLTDFGLS KVGL I NSTDDLSGPAVSGSSL 445 

Qy 232 PESKQ ARANSFVGTAOTVSPBLLTBKSACKSSDLWALGCI I YQLVA 377 

hi i I III h'lhl hi h:| huh 

Db 446 YGDDB PQMSE FEEMDHRARRQKRSAVGTPO YLAPB I LLGTGHGTSADWWSVGV I LFEL I V 505 

Qy 278 GLPPPRAGNEYLI FQKI I KLBYDP- - -PEK7FPKARDLVBKLLVLDATKRLGCEEMEGYG 334 

hill I i 11 h Ih i|ilh>lll I illl I 

Db 506 G I PPPNAEHPQTI PDN I LNRKI PWPHVPEEMSSBAQDL I DKLLTEDPHQRLGA NGVS 562 

Qy 335 PLICAHPPPBSVTWENIJIOQTPPKLTAYLP AMSBDOSDCYGNYDNL 379 

:| I Ih nh I • I h = l : = Ih I | 

Db 563 BVKQHQPPKD I SWDTLARQK AAPVPSSDSAFDTSYFTSRYSWNPSDENIYBAY- - - 615 

Qy 380 LSQFGCMQVS SSSSSH6 LSASDTOLPQRSGSN I BOY I HDLDSNS FEU) 437 

Db 616 BFBdIsDNOSLSgIsSCV SNKcWdMGDESSGPTBPE 651 



RESULT 11 



RESULT 13 

US-10-449-902-41431 

; Sequence 41431, Application US/ 10449903 
; Publication No. US20O60123S0SA1 
; GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

; APPLICANT: Bio -oriented Technology Research Advancement Institution. 

; APPLICANT: The Institute of Physical and Chemical Research. 

; APPLICANT: Foundation for Advancement of International Science. 

; TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

; FILE REFERENCE: MOA-A0205Y1-US 

; CURRENT APPLICATION NUMBER: US/10/449,902 

; CURRENT FILING DATE: 2003-05-29 

; PRIOR APPLICATION NUMBER: JP 2002-203269 

; PRIOR FILING DATE: 2002-05-30 

; PRIOR APPLICATION NUMBER : JP 2002-383870 

; PRIOR FILING DATE: 2002-12-11 

; NUMBER OF SEQ ID NOS: 56791 

; SOFTWARE : Patentln Vor. 2.1 

; SEQ ID NO 41431 

LENGTH: 580 
; TYPE : PRT 

; ORGANISM: Oryza sativa 
US-10-449-902-41431 

Query Natch 18.5%; Score 544; DB 6; Length 580; 

Best Local Similarity 34.2%; Prod. No. 3e-20; 

Matches 125; Conservative 72; Mismatches 135; Indela 34; Gaps B; 



Qy 76 KKRPEDFKFGKI LGEGSFSTVVLARELATSRSYAIKI LEKRHI IKENKVPYVTRERD'VMS 135 

I I = 11 = I II III 1 = !! I = = l = llhll II hi = II - 

Db 244 KVRLDDPELLKVLGRGSFGKVMQVKKKTTGKI FAMKI LKKRAI IARNQVEHTKAERKI LQ 303 

Qy 136 RLDH PFFVKLY FTFQDDE KLYFGLSYAKNGELLKY I RKIGSFDETCTRFYTAB I VSALEY 195 

I III : M II Mill I I : III >:: II I I II =11 i 
Db 304 AIXMPFLMTLRYAFQSIOSKLYFVUJYYQGGELFFKLKNKRRFGEDVARIY^ 363 

Qy 196 LHGKG 1 1 HRD LKP EN I LLNEDMH I Q I TD FGTAKVLS PES KQARANS FVGTAQYVS PELLT 255 

II hhllllllllll::: h Hill -1 = 1 I h = l II :|::||::| 

Db 364 LHS LGV I YRDLKPEN I LLDDNGHVCLTD FGLS KDVD PSDK AHTFCGTPBYLAP B I VT 420 

Qy 2S6 SKSACKSSDLWALGC I I YQLVAGLPPFRAGNEYLI FQKI I KLEYDFPEKFFPKARDLVEK 315 

h I hll ::hl hill « I «■ II II '■ h 

Db 421 GAGHDKAVDWWSLG I LLYELTVG I P P FYSQNVNEMYNX I QHGVLRF P PFLSEGCKSL I VA 480 

Qy 316 LLVLOATKRLGCEEMEGYGPLKAHPPPBSVTWENLHQ- - -QTPPKLTAYLPAMSEDDBDC 372 

II I Mil :hl lh : h : : : I | | i h 

Db 481 LLNRDPAKRLG--SGKDLDDVKSHAFFKDLNWDKMMRKEIEVP YKPKVKSGDDT- 532 

Oy 373 YGNYDNLLSQFGCMQV SS S SSSHS LSASDTGLPQR5GSN I EQY I HDLDSNS FELDLQFSE 432 

hi II i h : I I n|i I I i 

Db 533 -QNFDBTFTKEPV- -VDSHVPTSKLTETVTNEP DAFK-DFTFOQ 572 

Qy 433 DEKRLL 438 

= 1 =11 

Db 573 NESKLL 578 



RESULT 13 

US- 10 -44 9 -902 -44100 

j Sequence 44100, Application US/ 1044 9902 

; Publication No. US20060123505A1 

; GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

; APPLICANT: Bio-oriented Technology Research Advancement Institution. 

; APPLICANT: The Institute of Physical and Chemical Research. 

; APPLICANT: Foundation for Advancement of International Science. 

J TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

; FILE REFERENCE : MOA-A0205Y1-US 

j CURRENT APPLICATION NUMBER: US/10/449,902 

; CURRENT FILINO DATE: 2003-05-29 

; PRIOR APPLICATION NUMBER: JP 2002-203269 

; PRIOR FILING DATS: 2002-05-30 

; PRIOR APPLICATION NUMBER: JP 2002-383870 

; PRIOR FILING DATE: 2002-12-11 

; NUMBER OP SEQ ID NOS: 56791 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 44100 

; LENGTH: 485 

j TYPE: PRT 

; ORGANISM: Oryza sativa 
US-10 -449-902 -44100 

Query Match 18.3%; Score 537.5; DB 6; Length 485; 

Best Local Similarity 32.9%; Pred. No. 5.3e-20; 

Hatches 122; Conservative 72; Mismatches 138; Indela 39; Gaps 8; 

Qy 16 SSVVLCSCPSPSMVRTQTESSTPPGIPGGSRQGPAMD- - -OTAABPRPGAGSLQHAQPPP 72 

= hl I -II h I II I II 

Db 94 TSL VSQSLRLSNLTLNKTEGS SE PAEEKERNLGQLSDEE FDNATTBNEG I GL 145 

Qy 73 QPRKKRPEDFKFGKI LGEGSFSTWLARELATSREYAI KI LEKRHI I KENXVPYVTRERD 132 

= 11 = h=hhl I h II lhh = I h>l| h III 
Db 146 ODFEILKLVGQGAFGKVFQVRKKGTSE I YAMKVMRKDKI LBKNHAEYMKAERD 198 

Qy 133 VMSRLDH P FFVKL Y FTFQDDE KL YFGLSYAKNG E LLKY I RKI GS FDETCTRFYTAB I VSA 192 

'■'■'■'•-■WW hi »ll Ml I s II = 1111 I llilllll 

Db 199 I LTKVDHPFWQLRYSFQTKYRLYLVLDPINGGHLFFQLYKCGIJTIBELAR I YTAE I VSA 258 

Qy 193 LEYLHGKG 1 1 HRDLKPEN I LLNEDMH I Q I TD FGTAKVLS PBS KQARANS FVGTAQYVS PE 252 

« =11 Ihllllllllllh I I =1111 II hll II :|::|| 

Db 259 VAHLHDNG I MHRDLKPEN I LLDADGHAMLTDFGLAKE F NENTRSNSMCGTVEYMAPB 315 

Qy 253 LLTEKSACKSSDLMALGCI IYQLVAGLPPFRAGNEYLIFQKI IKLBYDFPEKFFPKARDL 312 

h = l h = l ■■■■'■■•■'■ I III II '■ 111 = 1 =1 =11 
Db 316 IILGRGHDKAADWWSVGILLFEMLTGKPPF-VGNRDKVQQKIVra 374 

Qy 313 VEKLLVLDATKRLGCEEMEOYGP LKAH P FFESVTWENLH - QQT P P KLTAYLPAMS 366 

= = II = II I I II =1 = = = = I I :| I 

Db 375 LKGLLHKEGGKRLGS G PGGSDE I KRYKNLKF I NWRKLEARQ I QPS FRPNVSGLT 428 

Qy 367 SDDEDCYGNYD 377 

I hi 

Db 429 CIANFD 434 



RESULT 14 

US-10-511-937-2450 

; Sequence 2450, Application US/10511937 

; Publication No. US20060088836A1 

; GENERAL INFORMATION: 

; APPLICANT: EXPRESSION DIAGNOSTICS, INC. 

; APPLICANT: Wohlgemuth, Jay 

; APPLICANT: Fry, Kirk 

; APPLICANT: Woodward, Robert 

; APPLICANT: Ly, Ngoc 

; APPLICANT: Prentice, James 

; APPLICANT: Morris, Mac Dona Id 

; APPLICANT: Rosenberg, Steven 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DIAGNOSING 

; TITLE OF INVENTION: AND MONITORING TRANSPLANT REJECTION 

; FILE REFERENCE: 506612000104 

; CURRENT APPLICATION NUMBER: US/10/511,937 

; CURRENT PILING DATE: 2004-10-19 

; PRIOR APPLICATION NUMBER: PCT/US2003/012946 

; PRIOR PILING DATE: 2003-04-24 

; PRIOR APPLICATION NUMBER: US 10/131,831 

; PRIOR FILING DATE: 2002-04-24 

; PRIOR APPLICATION NUMBER: US 10/325,899 

; PRIOR FILING DATE: 2002-12-20 

; NUMBER OF SEQ ID NOS: 3117 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 2450 

; LENGTH : 706 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-511-937-2450 



Query Match 17.9%; Score 526.5; DB 6; Length 706; 

Best Local Similarity 31.5%; Pred. No. 2.6e-19; 

Matches 139; Conservative 80; Mismatches 149; Indela 73; Gaps 19; 



oy 


3 


RTTSOLYDAVP I QSSWLCSCPSPSMVRTQTESSTPPG I PGGS RQG PAMDGTAAB - PRPO 6 1 
|||:: |:: : | :: : > || :| :: | | : | | 
ROTEQIFREGPVBIGL PCSIKNB- - -ARPPCLPTPGKREP- -QOISWBSPLDB 352 


Db 


305 


oy 
Db 


= 1 1 h hll III hlhlll 1 II h= =111 1 
353 VDKMCHL-PEPBLNKERPSLQI KLKIEDPI I>JKMI>GKGSFGKVFLAEPKKTNQPFAI KAL 411 


oy 


114 


EKRHI IKEJJKVPYVTRERDVNS-RI^PPFVTCLYPTFOD 172 
= 1 == 1 h 1 = 1 =111 == III 1 hi : 1 hh =h 
KKDWLMDDDVECTMVE KRVLSLAWEHP FLTHMPCrFOTKENLFFVHEYLMGGDLMYH I Q 471 


Db 


412 


Qy 
Db 


173 
472 


KIGSFDBTCTRPYTABIVSALSYLHGHGI IHROLKPENILLNSDMHIQITOFOTAXVLSP 232 

II = II llh h=ll Ill-Mil :Mlh = l Ihl III 1 
SCHKFDLSRATFYAAB 1 1 IX3LQFLHSHG I VYRDLKLDNI LLDKDGH I KIADFGMCX 527 


oy 

Db 


233 
528 


ESK- •QARANSFVGTAQYVSPELLTBKSACXSSDLHALGCI 1 YQLVAGLPPFRAGNEYLI 2 90 
h h hi II l = =lhl = 1 1 h 1 1 II =1 = 
ENMLGDAKTNT FOGTPDY I APE I LLGQKYNHSVDWWS FGVLLYEML I GQS PFHGQDEBEL 587 



Qy 291 FQKI IKLEYDFPEKFFPK ARDLVS KLLVLDATKRLGCEBMEGYGPLKAH PFFESV 345 

II II hh hlh II I = Mil I := II I = 

Db 588 FHS I RMDNP - - FYPRWLEKEAKDLLVKLFVREPEKRLGVR GD I RQHPLFRE I 637 

Qy 346 TWENLHQQ- -TPPJOliTAYLPAMSEDDEDCYGNYDNLLSQFGCMQVSSSSSSHSLSASDTG 403 

II I := II = I = II hi II :| 

Db 638 NWEELERKE I DPP FRPKV-KSPFDC-SNFDK BFLNEKPRLSFADRA 681 

Oy 404 LPQRSGSNIEQYIHDLDSNSF 424 

I h :| I I 

Db 682 L INSMDQNMP 691 



RESULT IS 

US-10-449-902- 54017 

; Sequence 54017, Application US/10449902 
; Publication No. US20060123S05A1 
; GENERAL INFORMATION: 

; APPLICANT: National Institute of Agrobiological Sciences. 

; APPLICANT: Bio-oriented Technology Research Advancement Institution. 

; APPLICANT: The Institute of Physical and Chemical Research. 

; APPLICANT: Foundation for Advancement of International Science. 

; TITLE OF INVENTION: FULL-LENGTH PLANT cDNA AND USES THEREOF 

; FILE REFERENCE: MOA-A0205Y1 -US 

; CURRENT APPLICATION NUMBER: US/ 10/449, 902 

; CURRENT FILING DATE: 2003-05-29 

; PRIOR APPLICATION NUMBER: JP 2002-203269 

; PRIOR FILING DATE: 2002-05-30 

; PRIOR APPLICATION NUMBER: JP 2002-383870 

; PRIOR FILING DATE: 2002-12-11 

; NUMBER OF SEQ ID NOS: 56791 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO S4017 

; LENGTH: S56 

; TYPE: PRT 

; ORGANISM: Oryza sativa 
US-10-449-902-54017 

Query Match 17.4%; Score 512.5; DB 6; Length 556; 

Best Local Similarity 30.6%; Pred. No. le-18; 

Matches 132; Conservative 79; Mismatches 151; Indela 69; Gaps 12; 

Qy 73 QPRKKRPEDFKFCKII^EGSFSTVVLAREIATSRBYAIKJLBKRHIIKENKVPYVTRB 132 

107 CJfflKMOVroFELLTi^ !66 

Qy 133 VMSRliDHPFFViajYFTFi^DHKLYFGI>£rfAKlJSELLICYI RKIG5FDSTCTRFYTAEI VSA 192 

=1 I HlhHIhl II = I h = = = = « I III II I I 
Db 167 LLAEVDSAP I VKL YYSFQDSBYL YL I MEYLPGGDMMTUMRKDTLTEDBARFY IABTVLA 226 

Qy 193 LBYLHGKDI IHRDLKPENILLNEDMHIQITDPGTAKVLS 231 

= 1 =1 lllhlhhlh h -III | | 

Db 227 I ES IHKHSY I HRD I KPI3^^AlI^RSaHLKl£DFaLCKPLi)SSN7PNlJ^BPDYTSTKGTKPL 286 

Qy 232 PES KQARANS FVGTAQYVS PELLTE KSACKSSDLMAL 268 

hi » I I III h:|hl =1 | hi 



Db 


287 


Oy 


269 


Db 


347 


oy 


325 


Db 


406 


Oy 


364 


□b 


455 


oy 


437 


Db 


514 



287 PDSS SRLSSSALKRTQQEQLSHWOKNRRXLAYSTVOTPD Y I AS> EVLLKMC3YGMECDWWSL 346 

GCI I YQLVAGLP PFRAGNSYL I PQKI I - - KLBYDFPB - - KFFPKARDLVEKLLVLCATKR 324 

I hl««s I III : >||: III I hhlh III = =1 
GA I HYEKLVOYP P FYS EDPMSTCRKI VNWRSH L KF P EEAJCLS P EAKD L 1 5KLL - CNVEQR 405 

I/}CEEMBGYGPLf^PPPSSVrWSNIiIQQTPPKLTAYLPAM-SSOD&Z>CYGMYDNIXSQF 383 

II :| OHM « II hi l = = l = II I 

U3 TKCAHS I KAH PW PRGLPWERL YQHK AAF1PBVNSBLDTQNFBKFB ST 454 

OCMQVSSSSSS H S LSASDTG LPQRSGSN I BOY I HDLDSNSFBLDLQFSEDEK - - - R 436 

I h III I I i I =11 I : II i h =: 

GA-QIQSSSKSC 

LLLEKQAGGNP 
I I I 

524 



Search completed: June 27, 2006, 01:52:10 
Job time : 20 aeca 
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